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COMSTAR Experiment: 


An Overview of the Bell Laboratories 19- and 
28-GHz COMSTAR Beacon Propagation 
Experiments 


By D. C. COX 
(Manuscript received January 10, 1978) 


Radio beacons on the COMSTAR communication satellites transmit 
continuously at 19 and 28 GHz to permit the long-term measurement 
of the properties of earth-space propagation needed in designing future 
high-capacity satellite communication systems. An extensive receiving 
facility has been established at Crawford Hill, New Jersey, for mea- 
suring attenuation, depolarization, coherence bandwidth and scatter 
of the beacon signals by atmospheric processes. The facility includes 
a precision 7-meter antenna and multichannel receiving electronics 
designed to obtain optimum benefit from the COMSTAR beacons. Other 
Bell Laboratories receiving facilities in Georgia and Illinois are accu- 
mulating statistics on signal attenuation and diversity improvements 
for other climatic conditions. 


|. INTRODUCTION 


Propagation experiments using the 19- and 28-GHz beacons on 
COMSTAR satellites represent a significant milestone in the quest for 
design information for future satellite communication systems. The 
experimental results also contribute to knowledge of meteorological 
processes. Earth-space propagation information above 10 GHz is a key 
component in the exploration of concepts for high-capacity domestic 
satellite communication systems.!2 
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Future high-capacity satellite communication systems will probably 
use frequencies above 10 GHz because of the large segment of unused 
frequency spectrum available and because of spectrum crowding at the 
lower frequencies. In particular, frequency bands over 2000 MHz wide 
are allocated at 19 and 28 GHz, and 500 MHz bands are available at 12 
and 14 GHz for common-carrier satellite communication systems; at 4 
and 6 GHz the allocated frequency bands are only 500 MHz wide. Even 
now satellite systems at 4 and 6 GHz are severely constrained by re- 
quirements for avoiding interference with the extensive terrestrial radio 
networks that share these same frequency bands. 

New systems will probably reuse frequencies on two orthogonal po- 
larizations to double the usable bandwidth and may reuse frequencies 
among spot antenna beams covering small areas of high traffic concen- 
tration. Thus, knowledge of the decrease in cross-polarization isolation 
produced by rain and ice is needed and satellite antenna sidelobe control 
will be important. Sidelobe levels of earth station antennas and the 
scattering of energy from one antenna beam to another by rain will limit 
how close communication satellites can be spaced in the geosynchronous 
orbit. Other characteristics of future systems operating above 10 GHz 
compared to present 4 and 6 GHz systems are as follows: (1) they most 
likely will use smaller earth station antennas; (ii) they will experience 
more rain attenuation and may use transmitter power control or site 
diversity to cope with it; (iz) interaction with the ionosphere will be 
significantly less; (iv) they may use wider bandwidths and thus be more 
sensitive to delay dispersion, and (v) some systems probably will use 
digital modulation. Some system calculations that were used as a guide 
in selecting representative values of experimental parameters, such as 
earth station antenna size, are presented by Tillotson in Ref. 1. 

It is well known that rain affects radio propagation more severely at 
frequencies above 10 GHz.” However, present knowledge of rainstorm 
characteristics and atmospheric processes is not adequate for predicting 
all earth-space propagation characteristics from terrestrial propagation 
measurements, surface rain measurements, or radar measurements. 
Thus, the complete information needed for satellite communication 
system design can be obtained only from measurements made along 
earth-space paths. The complication and expense of placing radio sources 
in synchronous orbit has meant that continuously transmitting beacons 
were not available in the 1960s for measuring earth-space propagation 
characteristics. Advantage was taken of a natural extraterrestrial radio 
source, the sun,® and of thermal emission from rain itself? for indi- 
rectly measuring the attenuation of earth-space radio signals above 10 
GHz. The sun trackers and radiometers used for these measurements 
provided the only continuous attenuation data available in the late 1960s 
and early 1970s.° This data base has several limitations. The sun azimuth 
and elevation change continuously during the day and, of course, the sun 
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is not available during the night. Radiometer measurements of thermal 
emission from rain have a range restricted to about 12 dB because of rain 
scatter and uncertainty in effective rain temperature.? Also, these in- 
direct measurement techniques cannot provide any information on de- 
polarization or on bandwidth limitations imposed by the propagation 
medium. The NASA experimental ATS-5 and ATS-6 satellites carried 
beacons transmitting at 15 GHz and at 20 and 30 GHz, respectively. 
These beacons did not transmit continuously and were often not avail- 
able during rainstorms because of scheduling and total power constraints 
on the satellites. Therefore, continuous long term statistics could not 
be collected using these sources. In fact, availability of the ATS-5 and 
-6 beacons was so limited that a significant depolarizing phenomenon 
went undetected until continuously transmitting satellite beacons be- 
came available.® Propagation information is needed for the satellite 
bands at 12, 14, 19, and 28 GHz; however, expense, power and weight 
limitations restrict the number of satellite beacon sources that can be 
provided reasonably. Beacons transmitting within the 19- and 28-GHz 
bands were a logical choice for sources for earth-space propagation ex- 
periments because attenuation and other atmospheric interaction is 
greater at the higher frequencies, extrapolation is reasonable from 
measurements made at two frequencies, and it is safer to extrapolate 
from large numbers to small numbers. The 19- and 28-GHz beacons were 
put on the COMSTAR satellites then to satisfy the fundamental need for 
continuous long-term measurements of propagation parameters such 
as attenuation, depolarization, coherence bandwidth and differential 
phase. 

In support of this measurement effort, an extensive receiving facility 
has been established at Crawford Hill, New Jersey.’ The facility is de- 
scribed in Section III of this paper and in the next two papers.®’ of this 
issue. An interim receiving facility at Crawford Hill is described in the 
fourth paper.!° Bell Laboratories receiving facilities have been estab- 
lished near Atlanta, Georgia (Palmetto), and near Chicago, Illinois 
(Grant Park), for accumulating attenuation and diversity statistics for 
other climatic conditions. These measurements are described in the fifth 
paper.!! Other experimenters outside Bell Laboratories are also making 
use of the beacons for accumulating propagation information.!%2° This 
paper describes the COMSTAR beacons and the experimental measure- 
ments at Crawford Hill. 


ll. THE COMSTAR 19- AND 28-GHz BEACONS 


The need for measurements in different climatic regions of the U.S. 
along with the need for continuous measurements suggest that satellite 
beacons providing signals for propagation experiments should have U.S. 
coverage antennas. Such antennas provide about 30-dB gain. Beacons 
placed on operational satellites such as COMSTAR logically can make use 
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Table | — COMSTAR satellite beacon parameters 


Station-keeping tolerance <+0.1° E-W and N-S 

Antenna pointing tolerance <+0.1° 

Carrier frequencies 19.0400 GHz and 28.5600 GHz 
Variation: Diurnal <+1 X 1076 


Aging <+1 X 10-6 per year 
Maximum <+5 X 1071! per second 
Rate 
Jitter 90% of carrier power in bandwidth <+48 X 1071!° 


EIRP 
19 GHz >+52 dBm per polarization 
28 GHz >+56 dBm 
Variation*: Diurnal <+0.3dB 
Aging <+0.5 dB per year 
aximum <+0.1 dB per minute 
Rate 
Polarization 
GHz Switched between two orthogonal linear 
28 GHz paneeaienee with most nearly vertical 19-GHz 
signa 
Orientationt 4° for satellite at 128°W 


21° for satellite at 95°W 
Crosspolarized components? >32 dB below copolarized level at worst case 


within U.S. 

Polarization switch (19 GHz only) 

Rate 1000.0 Hz + 0.1 Hz 

Stability <1 X 1077 per 10 min. 

Switching time <10 us. 

Asymmetry <+£5% 
Phase modulation (28 GHz only) Coherent with carrier 

Frequency** 264.4 MHz 

Sideband level <7 dB below carrier 

* A circuit malfunction has reduced the power output of the 19 GHz beacon at 128°W 


by 2 dB since launch. 

* Polarization orientation is the angle (<45°) that the received polarization is rotated 
from vertical or horizontal at Crawford Hill. 

+ >36 dB and >41 dB below 19- and 28-GHz copolarized levels, respectively, toward 
Crawford Hill. 

** A satellite scheduled for a later launch has a modulation frequency of 528.9 MHz. 


of the excess power generated by solar cells at the beginning of satellite 
life. Since this power is limited, low beacon power is desirable; for 
COMSTAR, about +30 dBm each from the 19- and 28-GHz beacons 
could be produced without having a detrimental effect on the primary 
communication mission of the satellite. Since a reliable measuring range 
of over 30 GB is desirable with simple, relatively inexpensive earth sta- 
tions and since a range of 60 to 70 dB at 30 GHz is desirable for extrap- 
olating attenuation data to lower frequencies, receiving system signal 
margin must be obtained by narrowing receiver noise bandwidth. Nar- 
row-band measuring systems, however, require very stable oscillators. 
These considerations, along with the needs for measuring depolarization 
and delay dispersion, resulted in the COMSTAR beacons with parameters 
summarized in Table I. 

The spin stabilized COMSTAR satellites illustrated in Fig. 1 were built 
by Hughes Aircraft Corporation and are owned and controlled in orbit 
by COMSAT General Corporation. They are leased to the AT&T and 
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cadet? ANTENNA 


ANTENNA 
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SECTION 


SPINNING=——— 
SECTION 


Fig. 1—COMSTAR satellite. Lower solar-cell-covered cylindrical portion is 9 feet high 
by 8 feet diameter. Overall height is 20 feet. 


GT&E Companies for domestic U.S. communication service.!? The lower 
half of the solar-cell-covered drum spins at between 50 and 60 revolu- 
tions/minute while the upper half, which supports the antenna platform, 
is despun to keep the antennas pointed towards the earth. The two large 
antennas are used by the 4- and 6-GHz communications system, one for 
each of two orthogonal linear polarizations. The 19- and 28-GHz beacon 
antennas are on opposite sides of the satellite, just below the commu- 
nication antennas. 

There are to be three beacon-equipped COMSTAR satellites in syn- 
chronous orbit. The first satellite was launched in May 1976 and is at 
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Fig. 2—28-GHz beacon coverage from satellite at 95° W longitude. Maximum antenna gain is 30 dB. Coverage for 19-GHz beacon is similar. 


128°W longitude; the second was launched in July and is at 95° W; the 
third is to be launched in the spring of 1978. 

The beacons were built by COMSAT Laboratories and are described 
in more detail in Ref. 13. The 19.04- and 28.56-GHz beacon signals and 
the 264.4-MHz signal that phase-modulates the 28.56-GHz signal are 
derived from a common 132.2-MHz quartz crystal oscillator. A common 
frequency multiplier chain multiplies the oscillator output to 2.38 GHz. 
The signal path splits into separate frequency multipliers after ampli- 
fication at 2.38 GHz. The 28.56-GHz multiplier output is phase-modu- 
lated before it and the 19-GHz multiplier output are amplified in sepa- 
rate negative-resistance IMPATT amplifiers. Polarization is switched 
(19 GHz only) with PIN diodes driven from a separate crystal oscillator 
and frequency divider. The beacons make use of the surplus dc power 
capacity of the solar-cell array that is available until the solar cells de- 
terioriate under the radiation environment of space. The projected 
availability of this surplus power is over two years (satellite lifetime is 
greater than seven years). The expected lifetime of the all-solid-state 
beacons is considerably greater than 7 years; however, due to a compo- 
nent failure the 19-GHZ beacon on the 128°W satellite has experienced 
regular daily power fluctuations since July 1976. 

As shown in Fig. 1, a two-horn antenna array transmits the linearly 
polarized 28-GHz signal. Another two-horn array transmits two or- 
thogonal linearly polarized 19-GHz signals. These two-horn arrays 
permit control of the polarization independent of the radiation pattern; 
polarization is determined by rotation of the horn apertures relative to 
the array centerline* while the pattern is determined largely by the array 
separation, the aperture dimension perpendicular to the array centerline, 
and the orientation of the centerline. The same antenna is used for both 
19-GHz polarizations so that the received differential phase will not be 
affected by angular motion of the satellite.* If two antennas with separate 
phase centers were used, unavoidable residual angular motion of the 
satellite (~0.1°) would produce differential phase shift as in an inter- 
ferometer. Although +0.1° is a small variation, it would cause up to +10° 
differential phase shift at 19 GHz between two antennas with phase 
centers placed as close together as possible. Figure 2 is a beacon an- 
tenna-coverage pattern constructed from pre-launch antenna-range 
measurements. Antenna patterns for the other frequency and polar- 
ization are similar. 


* The beacon at 128°W was designed to produce horizontal and vertical polarization at 
Crawford Hill with the satellite at 119° W. The beacon at 95° W was designed to produce 
horizontal and vertical polarization at Crawford Hill with the satellite at 129°W. Orbital 
assignments by the Federal Communications Commission (FCC) and launch scheduling 
did not permit placing of the satellites in their requested orbital locations. 

tM. J. Gans of Bell Laboratories collaborated with engineers at Hughes Aircraft in 
adapting the two-horn array for the two polarization application. 
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Fig. 3—Crawford Hill main receiving facility showing 7-meter diameter antenna and control building. 


Ill, THE CRAWFORD HILL RECEIVING FACILITY 


The Crawford Hill receiving facility was designed to obtain optimum 
benefit from the COMSTAR beacon emissions’ while also demonstrating 
techniques applicable to 19- and 28-GHz earth stations. The facility is 
located on top of Crawford Hill, New Jersey, about 50 km south-south- 
east of New York City at 40.392° north latitude and 74.187° west 
longitude. Crawford Hill is 115 meters above sea level. The main facility 
shown in Fig. 3 comprises the 7-meter diameter millimeter-wave an- 
tenna, receiving electronics, antenna pointing and data collection 
equipment, and computer programs indicated in the simplified block 
diagram in Fig. 4. The antenna, antenna feed and receiver front ends 
could function as communication system components. For example, the 
antenna size and gain are representative of the needs of typical high- 
capacity earth stations.! The antenna does not have any aperture 
blockage because of the offset geometry. This clean aperture, combined 
with a good surface and heavily tapered illumination, result in the low 
sidelobe levels that would be required for systems working with satellites 
closely spaced (~1°) in orbit. These low sidelobe levels are also needed 
in the propagation experiment for measuring the crosstalk produced by 
the scattering of radio waves by rain. The good surface, long effective 
focal length and feed design result in the low antenna cross polarization 
throughout the antenna beam needed by frequency reuse dual polar- 
ization systems and required in the experiment for measuring depolar- 
ization by raindrops and ice crystals. The simple format for obtaining 
open loop antenna pointing information from COMSAT also should prove 
useful in future system operation.!® The dual-frequency dual-polar- 
ization antenna feed uses low loss quasi-optical frequency and polar- 
ization diplexers suitable for diplexing high-power transmitters with 
sensitive receivers. The receiver front ends are low-noise broadband 
mixers and broadband IF amplifiers that provide a 1-GHz-wide channel 
suitable for system use from RF through first IF. Following the first IF, 
the receiver is narrow band and optimized to provide the sensitivity and 
stability needed for the propagation measurements. 

An interim receiving facility shown in Fig. 5 was used before com- 
pletion of the main facility. Measurements at the interim facility are 
continuing using the beacon at 128°W. From Crawford Hill the beacon 
at 95°W is at azimuth = 210.5° and elevation = 38.6°; the beacon at 
128°W is at azimuth = 244.7° and elevation = 18.5°. The main receiving 
facility is briefly described in the following sections. More detailed de- 
scriptions of this equipment and a description of the interim facility are 
contained in the following papers of this issue.®91° 
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Fig. 4—Configuration of main receiving facility at Crawford Hill comprising 7-meter antenna, receiving electronics, data collection subsystem and 
antenna pointing components described in the text. Facility simutaneously records COMSTAR beacon signal parameters for two polarizations at 19 
and 28 GHz and signals scattered off-axis from the main beam at 19 and 28 GHz. 
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Fig. 5—Crawford Hill interim receiving facility showing 3.7-meter-diameter antenna 
and instrumentation building. The facility described in Ref. 10 receives beacon signals 


on two polarizations at 19 GHz. 








3.1 7-meter antenna and antenna pointing 


The 7-meter diameter millimeter wave antenna® shown in Fig. 3 is of 
novel design and extreme precision. It is designed to operate in the high 
winds and generally bad weather that are characteristic of summer 
thunderstorms (and hurricanes). These bad weather events produce 
most of the propagation degradation such as attenuation and depolar- 
ization that are measured in the continuing propagation experiments. 
The antenna and pointing equipment are shared with radio astronomers 
who use the facility as a millimeter wave (100 GHz and above) radio- 
telescope. This joint use is compatible because of the inherent require- 
ments of the two types of experiments. Radio astronomy observations 
require clear atmospheric conditions that have minimum effect on 
millimeter wave propagation. These conditions, of course, produce little 
effect on the satellite beacon signals and thus no useful propagation data. 
When atmospheric conditions cause propagation degradation that 
should be measured, the conditions are not suitable for radio astronomy 
observations. 

The millimeter wave offset Cassegrainian antenna consists of a 7- 
meter diameter parabolic reflector on an altazimuth mount; a convex 
hyperbolic subreflector; dual-frequency (19 and 28 GHz) dual-polar- 
ization quasi-optical main-beam feeds; dual-frequency (19 and 28 GHz) 
single-polarization off-axis feeds; drive motors, angle encoders, and 
pointing servos; two equipment rooms; and feeds at other frequencies 
for radio astronomy. The antenna does not have a radome cover. 

All 19- and 28-GHz feeds are located in a small equipment room, the 
vertex cab, at the Cassegrainian feed point near the vertex of the main 
reflector, above both the azimuth and elevation axes. The main beam 
feeds receive simultaneously two linear orthogonal polarizations at both 
19.04 GHz and 28.56 GHz; these feeds are mounted on a single frame that 
permits rotation of the feed polarization about the coaxial main beam 
axis without affecting polarization orthogonality or beam pointing. The 
off-axis feeds form a beam that is pointed away from the axis of the main 
beam. The 0.002-inch Mylar feed window in front of the vertex cab is 
covered with a nonwetting coating and is nearly vertical to prevent water 
from collecting on it. The second, somewhat larger, equipment room, 
the side cab, is to one side of the antenna, on the elevation axis but above 
the azimuth axis. The antenna has the following characteristics: 


Gain 19 GHz 61 dB 
28 GHz 64 dB 
Beamwidth: 19 GHz 0.17° 
28 GHz 0.11° 
Cross-polarization >35 dB throughout main beam 
isolation: 
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Sidelobes: <—40 dB at >1° off-axis 


Pointing error: <0.0028° 20 mph steady wind 
<0.017° 45 mph steady wind + 
gusts to 60 mph 
0.001° angle readout 
Slew rates: 2°/sec azimuth; warm weather 


1°/sec elevation 


The antenna is fully steerable by the drive system and servo over an ele- 
vation of 0° to 90° and an azimuth of 0° to 450° with the azimuth 
rotation limited by the wrap-up of cables (no slip rings are used). 

The antenna is pointed by a minicomputer that compares angle en- 
coder outputs with command pointing angles and calculates velocity 
commands for the drive system from the resulting error. The com- 
manded pointing angles for the COMSTAR satellites are calculated by 
the computer from parameters derived from orbital predictions.!& 
COMSAT General supplies via teletype a set of coefficients for equations 
describing antenna azimuth and elevation angles at Crawford Hill. These 
equations result in a pointing error of less than +0.008° when updated 
every 2 weeks, assuming perfect orbital prediction. The antenna follows 
the +0.1° diurnal motion of the satellite within about +0.01°. 


3.2 Receiving electronics 


The propagation experiments placed strong demands on technology 
for the receiving electronics.? Continuous unattended operation is re- 
quired so that all significant weather events are included in the resulting 
data base; thus, a very high degree of reliability is necessary and rapid 
automatic reacquisition of the beacon signal after dropout due to severe 
attenuation or momentary power outage is essential. Since relative 
phases of the many signal components must be precisely measured, the 
phase stability of all circuits and components demanded careful atten- 
tion. Also, circuit arrangements had to be devised to ensure that signals 
to be compared in phase traverse a common path through high-gain 
amplifiers and other phase-sensitive equipment. 

In order to obtain the maximum possible measuring range using the 
modest power radiated by the satellite beacons, very narrow receiver 
noise bandwidths are required. This in turn requires excellent stability 
in the source oscillators in the satellites and the local oscillators in the 
earth stations. The receiver includes an automatic frequency control 
circuit with built-in memory to facilitate reacquisition after loss of sig- 
nal.* Maximum use was made of known correlations among strong and 


* The feature also permits easy return to propagation measurements after use of the 
antenna during clear weather periods for radio astronomy observations. 


COMSTAR OVERVIEW 1243 


weak signal components to permit detection of weak cross-polarized 
signals during severe fading. 

The receiving electronics portion of Fig. 4 is subdivided to indicate 
receiving functions. This part of the facility includes the balanced mixers 
(frequency converters), oscillators, frequency multipliers, IF amplifiers, 
bandpass filters, switches, and envelope (amplitude) and phase detectors 
required to process the following signals: (i) the nearly vertically and 
horizontally polarized 19-GHz main beam signals, 19V and 19H, (ii) the 
corresponding crosspolarized 19-GHz main-beam signal components, 
19XV and 19XH, (iii) the 19- and 28-GHz off-axis beam signals, 190A 
and 280A, (iv) the nearly vertically polarized 28-GHz main-beam carrier, 
28VC, upper sideband, 28VU, and lower sideband, 28VL, and (v) the 
corresponding cross-polarized 28-GHz main-beam carrier, 28XV. In- 
cluded in these functional blocks are the circuits for automatic frequency 
control, frequency acquisition, polarization switch synchronization and 
receiving system calibration. The receiving electronics are distributed 
among the two equipment rooms on the 7-meter antenna and the control 
building about 15 meters away from the antenna. The equipment dis- 
tribution optimizes noise performance and phase and amplitude stability 

_while staying within space limitations in the various equipment rooms. 
Since power line transients and momentary power outages are expected 
during heavy rain, all oscillators, filter stabilizing ovens and frequency 
memory registers are powered by batteries charged continuously from 
the power line. The receiving electronics have the following character- 
istics: 


19 GHz 28 GHz 
V,H XV,XH,OA VC,XV VU,VL OA 
Noise figure <7dB <7dB 


Noise bandwidth 16Hz 16Hzand 24Hzand 24Hz 24Hz 
1.6 Hz 2.4 Hz 


Channel-to-chan- >65 dB >68 dB 
nel isolation 

Amplitude <0.5 dB <0.5 dB 
instability 

Phase instability <2° <5° 

Frequency 2 Hz/sec 3 Hz/sec 


tracking rate 


Since the 19- and 28-GHz beacon signals are derived from a common 
oscillator, they have the same frequency fluctuations. Thus, extended 
measuring range is provided in the 28-GHz channels and in 19-GHz 
low-signal channels (off-axis and cross-polarization) by: (1) using com- 
mon frequency sources for corresponding 28-GHz and 19-GHz receiver 
local oscillators (LOs), (ii) tracking out frequency fluctuations in the 
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Table Il — Radio link parameters for Crawford Hill, New Jersey 


19.04 GHz 28.56 GHz 

Beacon (95°W) output power +27.7 dBm +30.6 dBm 
Polarization switching —3 — 
Satellite antenna gain +28.6 dB +28.2 dB 
EIRP per polarization (average) +53.3 dBm +58.8 dBm 
Path loss 22,300 miles —209.7 dB —213.2 dB 
Crawford Hill antenna gain +61 dB +64 dB 
Clear air absorption (Og and H,O) —0.6 dB —0.9 dB 
Signal into receiver (avg. per 

polarization) —96 dBm —91 dBm 
Noise into receiver (clear air) -—156dBm -166dBm -154dBm -—164dBm 

on een (16 Hz) (1.6 Hz) (24 Hz) (2.4 Hz) 

NF 


Receiver carrier-to-noise (C/N) +60 dB +70 dB +63 dB +73 dB 
(clear air with noise blanking) (16 Hz) (1.6 Hz) (24 Hz) (2.4 Hz) 
Oscillator stability 


[PLL off (below thresh.)] —0.2 dB — = — 
Measurement range (in rain) 
(to C = N) (ant. temp. 273°) 59 dB 69 dB 62 dB 72 dB 


* A single complex pole pair bandpass filter with 10-Hz 3-dB bandwidth has a noise 
bandwidth of 16 Hz. 


beacon and in LOs with a common oscillator in a loop locked in phase to 
the 19-GHz vertically polarized signal, the signal that experiences the 
least attenuation, and (iii) using very narrow-band filters in the extended 
range channels. 

Differential amplitude and phase stability is maintained by carefully 
controlling differential temperature between corresponding components 
in different receiver channels, by using low temperature coefficient 
components, by choice of IF frequencies and filter bandwidths, and by 
designing for good circuit linearity. 


3.3 Radio link parameters 


Radio link parameters affecting dynamic measuring range are sum- 
marized in Table II for the beacon at 95°W and the Crawford Hill 7- 
meter antenna and receiving electronics. The major contributors to the 
19-GHz parameters are illustrated in Fig. 6. 

Filters with 1.6-, 24-, and 2.4-Hz noise bandwidths are in the channels 
that receive signals attenuated by rain to lower levels than the vertically 
polarized (V) 19-GHz signal. The 1-dB difference between clear air 
carrier-to-noise ratio and measuring range reflects the difference in 
antenna temperature between clear air and rain. 

Refraction in the atmosphere due to temperature and humidity gra- 
dients does not have a significant effect on the experiment.!4 For ex- 
ample, the maximum expected surface refractive index variation at 
Crawford Hill is <445 ppm for August, the month of maximum spread 
in refractive index. This refractivity variation corresponds to a £0.003° 
elevation angle variation for the 38.6° elevation from Crawford Hill to 


COMSTAR OVERVIEW 1245 


TOH ee 










19 GHz 
BEACON 
+28 dBm 






EIRP 


CARRIER FILTERS 









1.6 Hz +70 dB C/N 
r- NOISE F#o-—————— > 
1 BW 
| 
SIGNAL POLARIZATION 
_ m 
SWITCH \ eicie 
0 0 NOISE 
NOISE —3 dB BW +60 dB C/N 
—156 dBm/16 Hz 
NOISE SIGNAL—3 dB 


Fig. 6—Radio link summary indicating major parameters that determine the carrier-to- 
noise ratio of the 19-GHz COMSTAR beacon signals received in clear air with the Crawford 
Hill main receiving facility. 


the satellite at 95°W. The elevation variation would be <+0.008° for the 
18.5° elevation to the 128°W satellite. Such angular variations produce 
<+0.05 dB amplitude variation for the 7-meter antenna and insignificant 
change in differential phase and cross polarization. 

Faraday rotation of linear polarization by the ionosphere is insignif- 
icant at 19 and 28 GHz.!415 The maximum expected variation in po- 
larization angle is <£0.15° for the extremes of solar flares or magnetic 
storms. During normal ionospheric conditions the variation is signifi- 
cantly less. If the cross-polarization of the instrumentation system 
(antennas, beacon polarization switch and receiver) were zero, the 
rotation variation of +0.15° would produce a cross-polarized component 
of <—50 dB. Cross-polarization variation of this magnitude is insignif- 
icant in the experiment. 


3.4 Data collection 

The data collection equipment is common to all receiving channels 
as indicated in Fig. 4. Data that are critical for maintaining continuity 
in the data base for long term statistics, e.g., signal attenuation and de- 
polarization, are recorded continuously on analog ink-pen paper-chart 
recorders. These chart recordings provide a backup in the event of failure 
of the digital recording system and also provide a “quick look” at the 
recorded data. The logarithms of signal amplitudes are recorded on the 
chart recorders with a range of 50 dB. 
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All receiver outputs are multiplexed along with (1) system status in- 
dicators such as whether the frequency control loop is tracking or holding 
in a signal fade, (ii) outputs from weather instruments such as rain 
gauges, thermometers and wind speed recorders, (iii) outputs from the 
on-going interim experiment!® using the COMSTAR beacon at 128°W, 
and (iv) another propagation experiment!” using a 12-GHz beacon on 
the NASA/Canadian Communications Technology Satellite (CTs). These 
multiplexed signals are digitized, temporarily stored in the mini-com- 
puter core memory, screened by the computer for relevance, and stored 
on digital magnetic tape. Multiplexer and analog-to-digital converter 
sequencing, digital data buffering and digital tape drive control are 
handled by the same minicomputer that points the receiving anten- 
na. 

The objectives of the data screening procedure* are to minimize the 
amount of superfluous data stored while not discarding any relevant 
propagation data. The screening algorithm copes with the multiplicative 
signal fluctuations caused by the atmosphere and with the additive noise 
that dominates at low signal level. 

All receiver outputs are digitized every 1/4 second and temporarily 
stored as a sample set. A running mean of these samples is accumulated 
for each channel. This running mean is compared with the mean value 
last recorded for the channel. If for any channel, the running mean value 

‘becomes different from the previous mean value by an amount greater 
than that expected because of receiver noise and atmospheric fluctua- 
tions, the running means for all channels are recorded. These recorded 
running means then become the previous mean values for testing a new 
sequence of running means that starts with the next sample set. This 
procedure detects gradual changes in received signal parameters. A test 
for rapid or impulsive change in a received signal parameter is also made 
on the individual sample sets. A data set is recorded at least once each 
minute regardless of whether there are changes in the data. 

Each data set contains the time it was recorded so the time interval 
spanned is available for further data processing. This data screening 
procedure, then, records all significant changes in data whether in- 
stantaneous or gradual, records data periodically for equipment 
checking, and within these constraints minimizes the amount of data 
recorded. 

The data handling procedures also include provisions for (i) easily 
stopping and starting data collection when the facility is used for radio 
astronomy, (iz) recovering from primary power interruption,‘ and (iit) 


* The computer programming required for data screening and storage was done by H. W. 
Arnold. 

t The computer programming required for power failure recovery and antenna pointing 
was done by R. W. Wilson. 
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recording calibration signals on the data tapes. Weather data and system 
status are recorded on the magnetic tape periodically. 


3.5 Propagation parameters measured 


The propagation parameters measured and recorded at the main re- 
ceiving facility are briefly described in this section. A more detailed 
parameter description is included in Ref. 9. The measurements are re- 
corded for all events that produce propagation irregularities. These 
events are all associated with cloudy weather; the most severe are asso- 
ciated with precipitation. Propagation statistics for continuous time 
intervals spanning at least a year are being compiled from the data. 


3.5.1 19-GHz attenuation and depolarization measurements 


The measurements made on the polarization switched signals and 
illustrated in Fig. 7 are: 


A19V = co-polarized vertical signal amplitude (TVRV) 
A19XV = cross-polarized signal amplitude coupled from vertical 
to horizontal (TVRH) 
A19H = copolarized horizontal signal amplitude (THRH) 
A19XH = cross-polarized signal amplitude coupled from horizontal 
to vertical (THRV) 
~19V-H = phase difference between vertical and horizontal signals 
(TVRV and THRH) 
@19V-XV = phase difference between vertical signal (TVRV) and its 
cross-polarized component (TVRH) 
@19H-XH = phase difference between horizontal signal (THRH) and 
its cross-polarized component (THRV) 


where TV indicates transmit vertical polarization from the satellite, TH 
indicates transmit horizontal, RV indicates receive on vertical polar- 
ization on the ground, and RH indicates receive horizontal. Attenuation 
is obtained by comparing amplitudes of attenuated signals with ampli- 
tudes of clear air signals. The measurement of ¢19V-H requires holding 
a phase reference from one polarization switch time period to the next. © 
This sets a lower bound on the switching frequency determined by the 
instabilities of both the satellite and ground station primary oscillators 
(phase references) and the desired accuracy of the phase measure- 
ment. 

Attenuation and depolarization of signals with any transmitted po- 
larization can be determined directly from these amplitude and phase 
measurements without having to assume a rain model.* This procedure 


* This capability is desirable since all positions within a U.S. coverage beam cannot have 
their polarizations set optimally with respect to their local rainfall. 
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Fig. 7—Three-dimensional diagram illustrating with different crosshatching the 
transmitter sequencing in time for the vertically (V) and horizontally (H) polarized 19-GHz 
beacon signals. Also illustrated in two different planes are the reception on V and H po- 
larizations of these signals. The actual signals received are indicated by arrows. Labels 
of measured signal parameters are as follows. A19V and A19H are copolarized 19-GHz 
signal amplitudes; AI9XV and A19XH are cross-polarized 19-GHz signal amplitudes; 
$19V-XV and $19H-XH are phase differences between co- and cross-polarized signals; 
and $19V-H is differential phase between copolarized signals. Note that the measurement 
of $19V-H requires holding a phase reference between time slots when the beacon transmits 
V and when it transmits H. 






is described in Refs. 7 and 18. These results will be useful for determining 
({) if there is a fixed polarization orientation with precipitation-related 
crosstalk low enough to permit doubling capacity by simultaneously 
using the same frequency on two polarizations, or (iz) if (7) is not possible, 
then if a sufficiently low crosstalk can be obtained simply by tracking 
the linear polarization rotation, or (iii) if amore complicated crosstalk 
minimizing technique will be required, or (iv) if crosstalk is low enough 
to permit use of orthogonal circular polarizations or non-optimally ori- . 
ented orthogonal linear polarizations for capacity doubling. Determining 
parameters for signal polarizations different from the transmitted po- 
larizations with sufficient accuracy for use in system design requires 
measurement accuracy considerably greater than is necessary for using 
the measured parameters directly in design.!8 


3.5.2 28.56-GHz attenuation and depolarization measurement 


Since only one polarization is transmitted at 28.56 GHz, only A28V, 
A28XV and $28V-XV can be measured. These amplitudes and phases 
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are similar to the 19-GHz measurements made in the transmit V time 
slot (see Fig. 7). Thus, crosstalk can be determined directly only for the 
polarization orientation of the measurement. Estimates of crosstalk for 
other polarization orientations and for other frequencies will be made 
using these direct measurements, rain models and the extensive 19 GHz 
measurements. 


3.5.3 Amplitude and delay dispersion measurements (usable bandwidth) 


The satellite transmits a carrier with frequency fo = 28.56 GHz and 
two sidebands coherently related to wo = 2afo with frequencies wy = 
[(n — 1)/n] wo and w, = [(n + 1)/n] wo. These signals can be represented 
at the earth station by s(t) = A(w)cos[wt — w7(w)] where 7(w) is the 
dispersive delay at frequency w and A(w) is the dispersive amplitude. 
If there is no dispersion, A(w) and 7(w) are constants; normalized values 
at the satellite are A(w) = 1 and 7(w) = 0. 

Two terms, A, and Ag or 7; and 7», for series representations of the 
amplitude or delay, A(w) = A, + Ayw + Aqw? +... and 7(w) = 75 + TH 
+ tow? +..., can be obtained from measurements of differential am- 
plitudes, A,, = A(w,) — A(w,) and Ag = A(wo) — A(we) and differential 
phases, ¢, = [n/(n si 1)]wy,7 (wy) — WoT (wo) = Wo [t(w) a T(wo)] and 
be = WoT (wo) — [n/(n — 1)] wet (we) = wo[T(wo) — T(we)]. 

The coefficient A, can be determined by an absolute attenuation 
measurement. The absolute delay 7, cannot be determined since the 
phase at the satellite is unknown. With only the carrier and one set of 
sidebands, no higher order dispersion coefficients can be determined. 

_ The measurement of differential phases ¢, and ¢¢ requires scaling 
of w, by n/(n + 1) to w, and of we by n/(n — 1) to w, as indicated. 

Measurement of dispersion (if it exists because of precipitation or 
index of refraction inhomogeneities) is made over the 528-MHz band- 
width (n = 108) and the 1056-MHz bandwidth (n = 54) permitted by 
the different satellites with the different sideband frequencies (+264 
MHz and £528 MHz). These measurement bandwidths are large enough 
to cover system requirements for many years since it is unlikely that 
transmission rates for individual channels within these bands will exceed 
1 gigabit/s in the near future. 


3.5.4 Rain signal-scatter measurement 


The off-axis beam of the 7-meter antenna points toward a geosta- 
tionary orbit position that is currently not occupied but could be occu- 
pied in the future. Beacon signal scattered by rain or other atmospheric 
phenomena into the off-axis beam represents a potential interference 
mechanism between the COMSTAR orbit position and the unoccupied 
orbital position along the off-axis beam. Measurement of the off-axis 
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beam signal amplitudes determines crosstalk levels and thus limitations 
on orbital spacing between satellites that may result from scattering of 
signal by the rain or other atmospheric phenomena. 


3.6 Performance 


Performance of the individual subsystems in the Crawford Hill 
propagation experiment is described in the previous sections and in the 
following papers.®? This section summarizes overall performance of the 
Crawford Hill main receiving facility working with the COMSTAR beacon 
at 95°W. 

Clear-air signal-to-noise (S/N) ratios measured on cold low-humidity 
winter days are 60 and 59 dB for 19-GHz H and V polarizations and 61 
dB for the 28-GHz carrier. These values are ratios of average signal power 
to average noise power measured at the narrow-band IF outputs. 

Figure 8 shows antenna patterns made by scanning the seven-meter 
antenna past the beacon and recording the logarithmic amplitude de- 
tector outputs. In Fig. 8, the lowest three curves, Al9XV, A19XH and 
A28XV, are the cross-polarized signals from receiver channels with 1.0-, 
1.0-, and 1.5-Hz 3-dB IF bandwidths, respectively; the upper curves, 
A19V and A28V, are the copolarized signals from receiver channels with 
10- and 15-Hz 3-dB IF bandwidths. The main beam of the A19H curve 
lies on top of the A19V curve and A19H sidelobes are all within a few dB 
of A19V sidelobes. The maximum values of cross-polarization discrim- 
ination, i.e., the cross-polarized signal level relative to the copolarized 
signal level at all points within the 3-dB beamwidths are XRV19 < 41 
dB; XRH19 < 36 dB and XRV28 < 45 dB. Differential phase ¢619V-H 
is an indicator of the accuracy of alignment of the phase centers of the 
7-meter antenna feeds. The differential phase varies <+1.2° over the 
3-dB beamwidths. Scans in planes other than the principal planes (azi- 
muth and elevation) are similar. 

These antenna pattern measurements illustrate the combined per- 
formance of the beacon antennas and the 7-meter antenna. The mea- 
surements also demonstrate the capability of the receiver to maintain 
frequency lock and polarization switch synchronization through deep 
signal fades (antenna nulls) and to measure the low cross-polarized signal 
components to levels of the order of 70 dB below the clear-air copolarized 
signal levels. 

Signal amplitudes vary less than +0.3 dB and differential phase varies 
less than +2° over several weeks. Most of this variation has a diurnal 
cycle. Day-to-day repeatability is within measurement resolution. Re- 
sidual cross-polarized signal levels change with a diurnal cycle that also 
changes slowly with time, apparently due to change in the orientation 
of the satellite with respect to the sun. Day-to-day repeatability of 
cross-polarized signal levels are within +0.2 dB. 
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(a) Elevation scan. 


Fig. 8—Crawford Hill 7-meter antenna scans of the COMSTAR beacon at 95°W made 
with the Crawford Hill main facility receiving electronics. Note that the worst first sidelobe 
is 22 dB down and that all others are at least 27 dB down. Sidelobe levels approach 40 dB 
down 1° off beam axis. Cross-polarized signals 19XH, 19XV, and 28X V remain more than 
38 dB below on axis copolarized signals throughout. Cross-polarization discrimination 
is better than 36 dB throughout the 3 dB beamwidths. Receiving electronics follow signals 
through antenna nulls. 


IV. CONCLUSIONS 


Continuous transmission at 19 and 28 GHz from the COMSTAR bea- 
cons in geosynchronous orbit are providing unique opportunities for 
gathering propagation information needed for designing future high- 
capacity satellite communication systems. An extensive receiving facility 
at Crawford Hill, New Jersey, comprising a precision 7-meter-diameter 
antenna and sophisticated receiving electronics, is providing a detailed 
look at propagation effects such as attenuation, depolarization, coherence 
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(b) Azimuth scan. 
Fig. 8 (continued) 


bandwidth and signal scatter that are usually related to atmospheric 
precipitation. This facility receives on two orthogonal linear polarizations 
at the two beacon frequencies. In clear weather when atmospheric at- 
tenuation is minimal, the signal-to-noise ratio when receiving the beacon 
at 95°W is 60 dB. Phase differences and amplitudes are measured for 
all signals and for their cross-polarized components. Cross-polarization 
isolations are >35 dB throughout the entire antenna beams to the —3-dB 
beam edges. 

Bell Laboratories receiving facilities in Georgia and Illinois are col- 
lecting information on signal attenuation and diversity in other climatic 
regions. Additional propagation information is being collected by non- 
Bell experiments at other locations. 


COMSTAR OVERVIEW 1253 


V. ACKNOWLEDGMENTS 


These COMSTAR beacon propagation experiments would not have 
been possible without the foresight, cooperation and support of a number 
of people at AT&T, at the Long Lines department of AT&T and at Bell 
Laboratories. The efforts of all these people are much appreciated by 
all of us who are now directly involved in these very successful experi- 
ments. 

EK. E. Muller very effectively coordinated the requirements of these 
experiments with COMSAT General, the overall satellite contractor, with 
Hughes Aircraft Corporation, the satellite and beacon antenna manu- 
facturer, and with COMSAT Laboratories, the beacon manufacturer. 

The contributions of many individuals concerned with particular parts 
of the experimental facilities are acknowledged in the following com- 
panion papers. H. W. Arnold was particularly involved in the overall 
design, assembly and operation of the Crawford Hill experiment. The 
efforts of H. H. Hoffman in these areas are also appreciated. The ex- 
periment was originally conceived and supported by L. C. Tillotson and 
D. C. Hogg among others. The continuing support and interest of A. A. 
Penzias and D. O. Reudink has been helpful. The backing of R. F. Latter 
and R. C. Harris of AT&T Long Lines and E. F. O’Neill of Bell Labora- 
tories is also appreciated. 


REFERENCES 


1. L. C. Tillotson, “A Model of a Domestic Satellite Communication System,” B.S.T.J., 
47, No. 10 (December 1968), pp. 2111-2137. 

2. D.C. Hogg, “Millimeter-Wave Communication Through the Atmosphere,” Science, 
159, January 5, 1968, pp. 39-46. 

. R. W. Wilson, “Sun Tracker Measurements of Attenuation by Rain at 16 and 30 GHz,” 
B.S.T.J., 48, No. 5 (May-June 1969), pp. 13883-1404. 

. R. W. Wilson, “A Three-Radiometer Path-Diversity Experiment,” B.S.T.J., 49, No. 
6 (July-August 1970), pp. 1239-1242. 

. D.C. Hogg and T. 8. Chu, “The Role of Rain in Satellite Communications,” Proc. IEEE, 
63, September 1975. 

. D.C. Cox, H. W. Arnold, and A. J. Rustako, “Some Observations of Anomalous De- 
polarization on 19 and 12 GHz Earth-Space Propagation Paths,” Radio Science, 
12, May—June 1977, pp. 485-440 and D. C. Cox and H. W. Arnold, “Preliminary 
Results from the Crawford Hill 19 GHz COMSTAR Beacon Propagation Experi- 
ment,” U.S. Nat’l Committee of International Union of Radio Science (USNC/URSI) 
meeting, October 11-15, 1976, Amherst, Massachusetts. 

. D.C. Cox, “Design of the Bell Laboratories 19 and 28 GHz Satellite Beacon Propa- 
gation Experiment,” IEEE International Conference on Communications (ICC ’74) 
Record, June 17-19, 1974, Minneapolis, Minnesota, pp. 27E1-27E5. 

. T.S. Chu, R. W. Wilson, R. W. England, D. A. Gray, and W. E. Legg, “The Crawford 
Hill 7-Meter Millimeter-Wave Antenna,” B.S.T.J., this issue, pp. 1257-1288. 

. H. W. Arnold, D. C. Cox, H. H. Hoffman, R. H. Brandt, R. P. Leck, and M. F. Wa- 
zowicz, “The 19- and 28-GHz Receiving Electronics for the Crawford Hill COMSTAR 
Beacon Propagation Experiment,” B.S.T.J., this issue, pp. 1289-1329. 

10. H. W. Arnold, D. C. Cox, and D. A. Gray, “The 19-GHz Receiving System for an Interim 

COMSTAR Beacon Propagation Experiment at Crawford Hill,” B.S.T.J., this 
issue, pp. 1331-1339. 
11. N.F. Dinn and G. A. Zimmerman, “COMSTAR Beacon Receiver Diversity Experiment,” 


- BS.TJ., this issue, pp. 1841-1367. 


oo 


Oo oa 


~] 


Oo oc 


1254 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1978 


12. AT&T Company Application to FCC for a Domestic Satellite Communications System, 
March 1, 1973 and papers in COMSAT Technical Review, 7, No. 1, Spring 1977: “The 
COMSTAR Program” by R. Briskman, and “The COMSTAR Satellite System” by 
G. Abu-Taleb, M. Kim, K. Manning, J. Phiel, Jr., and L. Westerlund. 

13. Papers in COMSAT Technical Review, Vol. 7, No. 1, Spring 1977: “Centimeter Wave 
Beacons for the COMSTAR Satellites” by L. Pollack, “Centimeter Wave Beacon 
Transmitter Design” by W. J. Getsinger, “19- and 28-GHz IMPATT Amplifiers” 
by M. J. Barrett, and ““Low-Jitter Oscillator Source for the COMSTAR Beacon” by 
R. E. Stegens. 

14. D. C. Cox, unpublished notes based on Ref. 15 and on B. R. Bean and E. J. Dutton, 
“Radio Meterology,” National Bureau of Standards Monograph 92, U.S. Govern- 
ment Printing Office, Washington, D.C., March 1, 1966. 

15. A. A. M. Saleh, unpublished notes based on J. A. Ratcliffe, “The Magneto-Ionic 
Theory,” University Press, Cambrdige, 1961; K. G. Budden, “Radio Waves in the 
Ionosphere,” University Press, Cambridge, 1961; J. M. Kelso, ‘“‘“Radio Ray Propa- 
gation in the Ionosphere,” McGraw-Hill, New York, 1964; and K. Davies, “Iono- 
spheric Radio Propagation,” U.S. National Bureau of Standards, Monograph 80, 
April 1, 1965. 

16. V. J. Slabinski, “Expressions for Time-Varying Topocentric Directions of a Geo- 
Stationary Satellite,” COMSAT Technical Review, 5, No. 1 (Spring 1975), pp 

—14, 

17. A.J. Rustako, Jr., “An Earth-Space Propagation Measurement at Crawford Hill Using 
the 12-GHz CTs Satellite Beacon,” B.S.T.J., this issue, pp. 1431-1448. 

18. D. C. Cox, “Some Effects of Measurement Errors on Rain Depolarization Experi- 
ments,” B.S.T.J., 54, No. 2 (February 1975), pp. 435-450. 

19. R. B. Briskman, R. F. Latter and E. E. Muller, “Call for Help,” IEEE Spectrum, 11, 
October 1974, pp 35-36. 

20. E. E. Muller, “Notes on the COMSTAR Experiment,” B.S.T.J., this issue, pp. 1369— 
1370. 


COMSTAR OVERVIEW 1255 


Copyright © 1978 American Telephone and Telegraph Company 
THE BELL SYSTEM TECHNICAL JOURNAL 
Vol. 57, No. 5, May—June 1978 
Printed in U.S.A. 


COMSTAR Experiment: 


The Crawford Hill 7-Meter Millimeter Wave 
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A 7-meter offset Cassegrainian antenna with a precise surface has 
been built and tested. Measurements using a terrestrial source were 
made and compared with calculations for 19, 28.5, and 99.5 GHz. Low 
sidelobe level (S —40 dB) at one degree off the main beam and low cross 
polarization (< —40 dB) throughout the main beam are achieved using 
a quasi-optical 19/28.5-GHz feed system that also demonstrates very 
low multiplexing loss (~0.1 dB). The prime-focus gain measurement 
at 99.5 GHz found the difference between the measured and calculated 
gains to be (0.79 + 0.45) dB, which is consistent with the expected rms 
surface error (~0.1 mm). Multiple-beam operation accommodates both 
propagation experiments with the COMSTAR beacons at 19 and 28.5 
GHz and millimeter wave radio astronomy observations without 
physical disturbance of equipment. 


I. INTRODUCTION 


The Crawford Hill 7-meter antenna (Fig. 1) was built for propagation 
measurements with the COMSTAR beacons at 19 and 28.5 GHz, and for 
radio astronomy at frequencies from 70 to 300 GHz. It demonstrates that 
low sidelobes and high cross-polarization rejection can be obtained in 
an earth station antenna serving several satellites simultaneously. The 
antenna will also serve as a test bed for future propagation and antenna 
measurements. 

The main part of this paper is contained in the following three sections. 
Section IT describes the 7-meter antenna starting with the initial re- 
quirements. Section III gives a detailed description of the 19/28.5-GHz 
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_ Fig. 1—The 7-meter offset Cassegrainian millimeter wave antenna. The subreflector 
is mounted below antenna aperture, and key structural parts are covered with insula- 
tion. 


feed system for the COMSTAR beacon experiment. Section IV describes 
measurements of the completed antenna at 19, 28.5, and 99.5 GHz using 
terrestrial sources on a tower at a distance of 11 km. 


il. DESCRIPTION 


2.1 Requirements 


The major requirements imposed by the satellite beacon experiment 
on the 7-m antenna are for cross polarization to be less than —35 dB 
within the main beam, sidelobes to be below —40 dB at 1 degree or more 
from the main beam, multi-beam operation to be possible, and perfor- 
mance in all weather to be good, especially during summer thunder 
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showers. The attainment of at least minimal performance at the high 
end of the radio astronomy band further requires a surface accuracy of 
0.1-mm rms and a maximum pointing error of 10” arc. The expectation 
of performing beam-feed experiments and of possibly accommodating 
large cryogenically cooled radio astronomy receivers made it desirable 
to provide for a Nesmyth focus (feed along the elevation axis) in addition 
to the Cassegrainian focus. 

The requirements for low sidelobes imposes severe limitations on the 
amount of aperture blocking that can be tolerated.! Thus, an offset 
paraboloid was chosen with almost no aperture blockage. This can be 
made compatible with the cross-polarization and multibeam require- 
ments by choosing a sufficiently large secondary focal ratio.?:34 An ad- 
ditional benefit of the offset design is a large return loss. The low sidelobe 
requirements also place restrictions on the allowable surface errors. The 
magnitude of the allowable errors depends on their correlation lengths 
and the angles at which sidelobes can be tolerated, but a tolerance of 
about 0.01) is needed to avoid degradation of the far sidelobes.® Thus 
a surface tolerance of 0.1-mm rms was chosen to simultaneously satisfy 
the sidelobe requirements at 30 GHz and allow radio astronomical op- 
eration in the 200 to 300 GHz band. 

The main reflector has a focal length of 6.5659 m. It is offset such that 
its bottom is 1.2074 m above the reflector axis. The subreflector is a 1.2-m 
by 1.8-m oval portion of a hyperboloid offset 8.258 cm above its axis (Fig. 
2). It has focal lengths of 0.8697 m and 5.2262 m giving a magnification 
ratio of 6.01.'The central ray from the feedhorn to the subreflector makes 
an angle of 6.828 degrees with the axis and the illumination cone has a 
half angle of 5.06 degrees. The main reflector is subtended by a circular 
cone of 26.75-degree half angle, and the axis of the cone makes an offset 
angle of 37.26 degrees with the axis of the main reflector. The geometry 
results in a blocking of 4.4 cm of the main reflector by the subreflector 
and an expected contribution to the cross-polarization level of —56 dB 
in the main beam region, well below the required level. 

The narrow beam pattern required for the feed is the main price which 
must be paid for the advantages of a large effective F/D ratio. Small- 
cone-angle corrugated horns have been rejected for both radio astronomy 
and propagation feeds because of their excessive length. In the radio 
astronomy feed, cryogenically cooled receivers are in use, and the best 
performance is obtained by cooling the feedhorn with the receiver and 
radiationally coupling out of the Dewar. Thus, a small-cone-angle horn 
would not only be a large mass to cool, but a large low-loss Dewar window 
would be difficult to make. Instead, a quasi-optical feed system® is used 
for radio astronomy to couple a small corrugated horn in the receiver 
Dewar to the 7-meter antenna. This feed system incorporates image 
rejection, local oscillator injection, and calibration. 
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Fig. 2—The subreflector with a machined aluminum surface of 20 um rms. The oval 
shape is provided for azimuthal off-axis beams. 


In the case of the 19/28.5-GHz propagation feed, the two widely sep- 
arated frequencies would lead to problems with frequency-dependent 
phase patterns of a single corrugated feed and with higher order modes 
generated in a low-loss waveguide polarization diplexer. These problems 
were circumvented by using quasi-optical methods’ for both polarization 
and frequency diplexing as shown in Fig. 3. Four launchers, each con- 
sisting of an offset ellipsoid and a corrugated horn, are used, each for a 
single frequency and polarization. A quasi-optical frequency diplexer® 
first combines two frequencies at each polarization, then a polarization 
grid? combines and simultaneously cleans the two orthogonal polariza- 
tions. A 45-degree mirror finally reflects the combined beam to the 
subreflector, leaving adjacent areas in the focal plane available for other 
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Fig. 3—Sketch of a 19/28.5 GHz dual-polarization feed system. The beam from the 
subreflector passing through the entrance aperture is first split by the polarizer and then 
subdivided by two frequency diplexers. 


receivers. This feed system was first tested on axis, but operates in a 
position to produce a beam 0.5 degrees off-axis in azimuth, leaving the 
on-axis feed position free for the higher frequency radio astronomy feed. 
This off-axis operation results in very little degradation in the antenna 
pattern. 

Past experience has shown that, after aging, radome surfaces hold 
thick water films during rain, causing large microwave attenuation.!° 
The 7-m antenna was therefore designed without a radome, but with 
thermal insulation of critical parts of the structure to allow operation 
in the sun without significant degradation of pointing or surface accu- 
racy. The support structure affords stiffness sufficient for operation at 
30 GHz in winds up to 70 mph. 


2.2 Mechanical description 


The mount for the 7-m antenna is a conventional elevation-over-azi- 
muth design with a single 2-m diameter cross-roller azimuth bearing and 
a yoke holding the elevation bearings. The elevation moving structure 
is built around a steel box girder. One end of the girder has a 32-cm bore 
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spherical roller bearing. The other end has a 1.4-m bore radial roller 
bearing sized to provide a 1.2-m diameter path to the side cab from the 
vertex area. The surface of the box girder facing the subreflector has a 
1.6-m opening for the RF beam, completing the path from the subre- 
flector to the elevation axis. A truss girder above the box girder supports 
the nine steel trusses that are radial to the axis of the main reflector and 
support its surface panels. The center truss is the longest and has been 
made deeper than the others by connecting its outer member to the ele- 
vation wheel support structure. The stiffness of the other trusses was 
adjusted using computer calculations so that gravity and wind deflections 
are expected to introduce mainly pointing errors with only small de- 
viations of the antenna surface from its parabolic shape. 

The 27 surface panels are arranged in four approximately equal width 
rings. The panels are A356 aluminum castings containing 17-cm deep 
ribs on the back side near the edges. These are joined by 9-cm deep ribs 
running in the circumferential direction with spacings of approximately 
30 cm. The castings were tempered to T51, rough machined on a nu- 
merically controlled milling machine, stress relieved, and then machined 
to the final contour on the same machine. The manufacturer tested all 
the panels on his milling machines and found the surfaces to be accurate 
to better than 50-~m rms with an average value of about 40 um. One of 
the early panels was tested on a Portage measuring machine and was 
found to have the same rms error (37 wm) as determined by the manu- 
facturer. It was then subjected to 10 rapid temperature cycles from 
—40°C to 60°C and back. Remeasurement on the Portage machine 
showed that the surface error had increased to 50-um rms. 

After machining, the panels were cleaned and painted with a 30 to 
50-um coat of an Alkyd base, TiO2 pigment, flat white paint. After final 
alignment of the panel corners to the alignment template (see below), 
five panels near the center were measured against the template on an 
18-cm grid. One of these panels was found to have a much larger surface 
error (100-um rms) than expected with one bad area and a general warp. 
A visual inspection showed this bad area to be by far the worst on the 
antenna. However, it is only a small contribution to the total surface error 
of 100-um rms. 

The panels are held to the back-up structure by 2.54-cm diameter 
adjusting bolts. At the panel end, these bolts attach to machined pads 
in the corners of the peripheral ribs with ball and socket joints to avoid 
transmitting torques to the panels. These adjusting bolts are perpen- 
dicular to the surface and bend to take up the differential expansion 
between the aluminum panels and the steel backup structure. 

Alignment of the surface panels was accomplished using a sweep 
template (Fig. 4) for reference. This was done under a tent with the an- 
tenna in the vertical look position, before installing the subreflector 
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Fig. 4—Panel alignment template mounted in place of subreflector support tower above 
i ete renee prior to installation of tent. The template consists of two sections joined 
y a link. 


support tower and the vertex equipment room. The alignment template 
was machined in two parts, each of which has a reference hole at each 
end. The reference holes at the outer end of the inner template and the 
inner end of the outer template are at the same height and were joined 
by an accurate link. A counterweighted boom built on an adjustable 
vertical bearing supported the template. Another link adjusted the inner 
hole of the inner template to the correct radius from the rotational axis. 
Before each use, the vertical axis was set using an electronic tiltmeter, 
and the four template reference holes were set to their required relative 
height using gravity-oriented links on an optical level. Twelve trans- 
ducers were attached to the template at radii corresponding to the panel 
mounting screws. They were referred to the accurate bottom surface of 
the template and used to measure panel corner positions. 

The subreflector support structure has been kept well below the axis 
of the main reflector, especially near the secondary focus. This, coupled 
with the oversized subreflector, allow beams up to 2 degrees off axis in 
azimuth or 1 degree in elevation to be launched from the vertex equip- 
ment room. The subreflector itself is a machined aluminum casting of 
the same material and temper as the panels. It was machined on a vertical 
lathe and has a measured surface accuracy of 20-um rms. 


7-METER MILLIMETER WAVE ANTENNA = 1263 


Computer calculations of thermal distortions of the structure coupled 
with temperature measurements on a test fixture showed that the sun 
shining on the structure from some angles would produce unacceptably 
large pointing and surface errors if the only thermal control were white 
paint. A strategy was adopted of insulating much of the structure and 
circulating ambient air through the critical volumes. The back of the 
surface panels has been sprayed with foam and the remainder of the main 
reflector backup structure enclosed by 5-cm-thick foam panels. A large 
blower floods this insulated volume with ambient air. The main members 
of the subreflector support structure are made of 10.16-cm-square steel 
tubing and are insulated. Ambient air is blown along them by blowers 
in the elevation girder. Most of the remaining steel structure is covered 
by 5-cm-thick panels of foam to increase thermal time constants, but 
no air circulation is provided. 

In addition to shielding the backup structure from solar heat, the in- 
sulation of the back of the main reflector surface panels reduces the heat 
flux through the panels and hence the steady-state temperature differ- 
ence between their surface and ribs. The penalty, however, is an increase 
(~2X) in the temperature rise of the surface panels in the full sun. This 
uniform temperature rise has the same effect as an absolute temperature 
change on the differential thermal expansion between the steel backup 
structure and the aluminum panels. It causes the panels to expand or 
contract in a direction parallel to the surface of the paraboloid defined 
by the backup structure. This has an almost negligible effect on the 
performance of the antenna. Warping of the panels due to thermal gra- 
dients perpendicular to their surface or thermal warping of the backup 
structure is much more important. 

Two equipment rooms are on the antenna structure (Fig. 1). The 
vertex equipment room and the hollow elevation girder behind it pro- 
vides a mounting space for receivers at the Cassegrainian focus. They 
move in elevation angle with the antenna. A double window of 50-um 
Mylar* sheet allows RF energy to pass into this area with negligible loss 
(~0.1 dB) at 100 GHz. The side cab is provided for the Nesmyth focus. 
It moves only in azimuth. At present, it contains only receiver support 
equipment and the main elevation cable wrap. An 8-m by 6-m control 
building 15 meters from the antenna contains the rest of the receivers, 
the control computer, and a work area. 

The volume under the azimuth bearing contains the azimuth cable 
wraps. A maypole wrap is used for most of the cables, but an auxiliary 
clock spring wrap carries the phase-sensitive RF cables. 

The drive for each axis uses a single bull gear with two motors con- 
nected to separate speed reducers and pinions. The amplifiers for the 
two motors are biased so that, for low torque output, they torque in op- 


* Registered trademark of E. I. Dupont. 
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posite directions to eliminate backlash, but when more than 15 percent 
of the maximum torque is required the opposing motor reverses. Each 
of the four de motors is rated at 3 horsepower with phase control exci- 
tation. The gearing is chosen to give a slew speed of 1 degree/s in eleva- 
tion and 2 degree/s in azimuth with 1 degree/s? acceleration in both axes 
and the ability to hold position or drive to the stow position in a 70-mph 
wind. This gearing results in the motor inertia dominating the antenna’s 
inertia. Each motor has a brake with the same torque rating as the motor. 
When the drive system returns to standby from an active condition, the 
brakes are set before the motors relax so that the anti-backlash preload 
of the gear system is maintained. 

The analog portion of the drive system is set up as a velocity control 
loop. Each drive motor has a tachometer generator. The sum of the ta- 
chometer signals is used in the main feedback loop, and the difference 
is used to damp possible oscillations in which the motors move in op- 
posite directions. 

Each axis has a direct drive Inductosyn* system which has an angular 
accuracy of 0.001 degree and a resolution of 21 bits. The antenna’s 
minicomputer reads the position of each axis every 10 ms, subtracts it 
from the desired position, and applies the scaled difference to the drive 
system as a velocity command. Drive system overshoots are minimized 
by compressing the gain of the feedback loop within the computer by 
a factor of 4 for command velocities greater than 14 of the maximum 
velocity. This strategy keeps the commanded velocity below the decel- 
eration limit of the drive system when approaching the final position. 
When a source is tracked that moves at the sidereal rate or slower, the 
servo error has not been observed to exceed 0.001 degree with winds up 
to approximately 50 mph. 


Ill. 19/28.5 GHz DUAL POLARIZATION FEED SYSTEM 
3.1 Quasi-optical diplexers 


The polarization diplexer (see Fig. 2) is simply a polarization grid made 
by photo-etching a copper-covered Mylar sheet. Copper strips 0.25-mm 
wide and 0.018-mm thick are spaced 0.25-mm apart on a Mylar sheet 
0.013-mm thick. The grid is mounted on an aluminum supporting frame 
with an oval-shaped aperture of 33.02 cm by 48.26 cm. To achieve a flat 
grid, the supporting frame is made of jig plate instead of regular alumi- 
num stock, which shows warping after machining. The plane of the grid 
is oriented at 45 degrees with respect to the incident beam from the 
subreflector. The conducting copper strips are perpendicular to the plane 
of incidence to avoid cross-polarized radiation? for both transmitting 
and reflecting orthogonal polarizations being diplexed. 


* Registered trademark of Farrand Industries, Inc. 
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Fig. 5—Transmission of a double-self-supported frequency diplexer (see Fig. 3) with 
the indicated dimensions of the grid mask at 6.3 mm spacing and 45-degree angle of inci- 
dence for two polarizations. 


Each of the two frequency diplexers is a pair of self-supported grids 
made of beryllium copper 75-um thick. The 6.3-mm spacing between 
the girds is maintained by mounting on opposite sides of an aluminum 
frame with an oval aperture of 30.5 cm by 45.7 cm. Figure 2 shows that 
the polarization for Feeds No. 1 and No. 3 is perpendicular to the plane 
of incidence at the frequency diplexer, while that for Feeds No. 2 and 
No. 4 is parallel to the plane of incidence. The transmission character- 
istics for both polarizations and the dimensions of this double-self- 
supported grid are given in Fig. 5, which has been taken from Arnaud 
and Pelow’s article. The measured insertion losses for both transmitting 
19.04 GHz and reflecting 28.56 GHz are less than 0.1 dB. Figure 5 also 
shows essentially perfect transmission and rejection frequency bands 
of well over 2 GHz each. This performance should be compared with a 
minimum insertion loss of 0.2 dB and a 2-GHz band loss of up to 1 dB 
for a typical waveguide diplexer at 19 and 30 GHz. 

Several alternate quasi-optical frequency diplexers were tested. For . 
a large (45-degree) angle of incidence, none of them achieved the desired 
transmission (—0.1 dB) and rejection (—20 dB) simultaneously for fre- 
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Fig. 6—The 19(28.5)-GHz conical corrugated horn and its hybrid mode launcher. 


quency bands in the ratio of 1.5 to 1. A single gridded Jerusalem-cross 
diplexer’ showed a transmission loss of 0.2 dB at 19 GHz for the polar- 
ization perpendicular to the plane of incidence and a rejection of only 
—10 dB at 28 GHz for the polarization parallel to the plane of incidence. 
When Mylar substrates were added to the double self-supported grid 
for improving its mechanical strength, transmission losses of 0.4 and 0.2 
dB were observed at 19 GHz for the perpendicular and parallel polar- 
izations. 

The mechanical resonance frequency of the quasi-optical grids has 
been measured to be around 80 Hz, which is well above the passband of 
the baseband data in the COMSTAR beacon experiment. 


3.2 Corrugated horns 


Four corrugated horns (two each at 19 and 28.5 GHz) were constructed 
to illuminate the offset elliposids. The dimensions of the 19-GHz cor- 
rugated horn are illustrated in Fig. 6, where the numbers inside the 
parentheses are essentially scaled designs for 28.5 GHz. They are 
shortened versions of a very long corrugated horn built by Dragone.!! 
The impedance matching between the smooth wall of the circular 
waveguide and the quarter wavelength slots of the corrugated horn is 
provided by a linear taper from half-wavelength corrugations that behave 
like a conducting surface. The 19-GHz transition from 1.27-cm diameter 
circular waveguide to 1.067 cm by 0.432 cm rectangular waveguide is 
hardware from the DR-18A terrestrial 18-GHz digital radio system. This 
transition consists of a tapered section from circular to square shape and 
a quarter-wavelength transformer. The 28.5-GHz transition from 


7-METER MILLIMETER WAVE ANTENNA = 1267 


———— E PLANE 


——-— HPLANE 


DECIBELS 


DEGREES 





36 24 12 0 12 24 36 
ANGLE IN DEGREES 


Fig. 7—Measured radiation patterns of 28.5-GHz corrugated horn. Rotation center 1.42 
cm behind horn aperture. 


0.846-cm diameter circular waveguide to 0.711 cm by 0.356 cm rectan- 
gular waveguide was also similarly designed. The measured return loss 
of all horns is better than 35 dB at the beacon frequencies 19 GHz and 
28.5 GHz, and remains better than 31 dB over a 2-GHz band. 

The measured far-field radiation patterns of the corrugated horns are 
shown for 28.5 GHz in Fig. 7, which are in excellent agreement with the 
calculations. The measured patterns at 19 GHz are essentially the same 
as those in Fig. 7, with the phase center located at 2.13 cm behind the 
horn aperture. The offset ellipsoid intercepts the horn radiation in the 
Fresnel region. It is difficult to measure the radiation pattern at a dis- 
tance corresponding to the ellipsoid location. However, the calculated 
20-dB half-beamwidth of the Fresnel zone radiation pattern is 2 degrees 
broader than those of the far-field patterns, and will be the same as the 
28-degree half-cone angle of the ellipsoid subtended at the focus as shown 
in Fig. 8. 


3.3 Offset ellipsoids 


To be mirror-imaged at the Cassegrainian focal region (secondary 
geometrical focus of the subreflector for the on-axis case), the common 
phase center of the four offset launchers should be located in the middle 
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Fig. 8—Schematic diagram of the 19(28.5)-GHz offset ellipsoid. 


of the feed box as shown in Fig. 9. Therefore, ellipsoids instead of para- 
boloids were used in the design of the feed system. One notes that a small 
movement of the phase center can be accomplished by the defocusing 
of a paraboloid. However, to place the phase center at a considerable 
distance in front of or behind a reflector with a small F/D ratio, an el- 
lipsoid or a hyperboloid should be used. The design of an offset launcher 
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Fig. 9—Top view of vertex equipment room which has a height of 2.3 m and moves with 
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was started with estimates by Gaussian beam approximation!2-4 and 
finalized by computer simulations which were essentially numerical 
integration of diffraction integrals,?:!5 assuming various design param- 
eters. 

Dimensions of both 19-GHz and 28.5-GHz offset ellipsoidal launchers 
are given in Fig. 8. Since the distance between the phase center and the 
28.5-GHz elliposid is much greater in wavelengths than that of 19 GHz, 
the 28.5-GHz ellipsoid has larger size and greater curvature than a scaled 
version of the 19-GHz ellipsoid. The offset ellipsoid is subtended by a 
circular cone at the focus. The offset angle of the feedhorn axis is 3 de- 
grees greater than that of the cone axis; thus approximately equal illu- 
mination taper can be achieved on the top and bottom edges of the re- 
flector. The intersection of an ellipsoid and a circular cone subtended 
at one focus is a plane ellipse subtended by another circular cone at the 
other focus. The radiating beam from the ellipsoid will lie along the latter 
cone axis which deviates from the major axis of the ellipse by 4.16 degrees 
at 19 GHz and 5.28 degrees at 28.5 GHz. Aluminum jig plates 2.54-cm 
thick are first cut into ellipses of 32.41 by 30.48 cm and 23.85 by 22.40 
cm; then they are positioned and oriented correctly with respect to el- 
lipsoidal axes for computer-controlled numerical machining. 


3.4 Mechanical mounting 


All the components are mounted in an L-shape frame of 106.7 X 106.7 
X 61 cm as illustrated schematically in Fig. 2. The frame is made of 
structural aluminum angles with sufficient diagonal bracing to insure 
rigidity without blocking the radiating beams. The mounting of corru- 
gated horns and offset ellipsoids was designed to allow adjustment for 
positioning, orientation, and focusing. . 

Owing to the uncertainty of the orbital position assignment for the 
COMSTAR satellite, the polarizations of the beacon signals were not 
precisely given. Therefore physical rotation of the feed frame around 
the beam axis is needed to match an arbitrary pair of orthogonal linear 
polarizations.* This required rotation is accomplished by a thrust ball 
bearing with 33-cm diameter circular opening in the front and a floating 
bearing in the back. These bearings are mounted on an aluminum sup- 
porting frame as shown in Fig. 10. This bearing mount is directly at- 
tached to steel I-beams through holes in the floor of the vertex equipment 
room of the 7-m antenna. The connection between the bearing mount 
and the steel I-beam allows differential thermal expansion between the 
indoor aluminum structure and the outdoor steel structure. 


* This rotation is also needed for calibrating the amplitude and phases of the cross- 
polarized signals received from the satellite beacons. 
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Fig. 10—A 19/28.5 GHz duo-polarization quasi-optical feed system (see Fig. 3) for the 
beacon measurements. 


The 45-degree flat mirror, which consists of a 0.794-cm-thick alumi- 
num jig plate, is mounted in front of the 33-cm bearing opening as shown 
in Fig. 9. Both azimuth and elevation orientations of the mirror can be 
adjusted around its center to facilitate experimental search for proper 
illumination of the subreflector. An oversized mirror of 38.10 by 53.34. 
cm oval shape was used in the initial measurents of the 7-m antenna. 
However, a smaller mirror of 30.48 by 43.18 cm shows very little trun- 
cation effect and is used in the beacon propagation experiment. 


3.5 Measured results of the feed system 


The design objective of the feed system is for each of the four offset 
launchers to illuminate the subreflector with a spherical wave of 15 dB 
taper over a 10-degree sector from a common phase center, with very low 
cross-polarized radiation as well as very low insertion loss of the quasi- 
optical diplexers. To achieve and demonstrate the desired performance, 
we aligned the components through pattern measurements in an ane- 
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Fig. 11a—Measured radiation patterns of Feed No. 1 at 19 GHz (see Fig. 3). 


choic chamber. To simulate the illumination of the 7-m offset Cassegrain 
antenna, the distance between the transmitting source horn used in the 
pattern measurements and the phase center of the receiving feed system 
was the same (526 cm) as that between the subreflector and its geo- 
metrical focus. Mechanical alignment was established before electrical 
measurements. 

The measured insertion loss for a quasi-optical polarization or fre- 
quency diplexer was found to be 0.1 dB or less in each case. Amplitude 
and phase patterns were measured with respect to the common rotation 
center and the common optical boresight. After a few iterative adjust- 
ments of orientations and locations for both corrugated horn and ellip- 
soid, good agreement between measured and calculated patterns was 
achieved for each offset launcher. In particular, the measured patterns 
verified the calculated beamwidth, the effective phase center of the offset 
ellipsoids, and the approximate pattern symmetry predicted in the 
asymmetrical offset plane. 
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Fig. 11b—Measured radiation patterns of Feed No. 3 at 28.5 GHz (see Fig. 3). 


The final measured patterns of Feeds No. 1 (19 GHz) and No. 3 (28.5 
GHz) are shown in Fig. 11 for both E and H planes. Essentially identical 
measured patterns were obtained for the orthogonally polarized pair of 
Feeds No. 2 (19 GHz) and No. 4 (28.5 GHz). Following the nomenclatures 
used for horn reflector antennas, the longitudinal plane is the asym- 
metrical offset plane which divides the ellipsoid into two symmetrical 
halves and the transverse plane is the orthogonal principal plane. The 
measured phase patterns are less than +5 degrees from a common 
spherical phase front while the measured amplitude patterns are within 
+0.5 dB of perfect coincidence over the 15-dB pattern-width illumi- 
nating a 10-degree sector with respect to the common boresight. The 
cross-polarized radiation remained below —45 dB for all directions. It 
was found necessary to cover both the transmitting horn and the front 
bearing mount of the receiving feed system with absorbers to avoid ex- 
cessive interactions. The ripples in the measured phase patterns of Fig. 
11 were identified as the effects of the residual interactions. 
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The measured phase patterns of Fig. 11 imply an alignment accuracy 
of about 499 beamwidth among the beams for the beacon receiver of the 
7-m antenna. One notes that any moderate asymmetry or misalignment 
of the amplitude pattern of the feed system, which has little effect on 
the beam-pointing accuracy, can be tolerated in a communication sys- 
tem. However, good alignment of the feed amplitude patterns is essential 
to the stability of the differential phase between the beams of the 7-m 
antenna. 

After the alignment measurements, the RF stages of the receiver!® for 
the 19- and 28.5-GHz beacon experiment were mounted on the feed 
frame. The alignment was then checked using the beacon receiving 
system. Tests were made on the thermal stability of the differential phase 
between the two orthogonally polarized 19-GHz feeds. The change of 
the differential phase with respect to temperature was about 0.1 degree 
per 1°F and could be partially explained by the difference in waveguide 
lengths. Local heating of the polarization-diplexing grid shows a 0.2- 
degree change of the differential phase. 

Since the calibration of the beacon receiver makes use of the rotation 
of the feed frame around the beam axis, the behavior of the differential 
phase during this rotation was examined. When the two orthogonal 
linearly polarized incident waves are of comparable magnitude (i.e., when 
the transmitting polarization is oriented at roughly 45 degrees with re- 
spect to the orthogonal polarizations of two 19-GHz receiving feeds), the 
measured differential phase remains essentially constant (within 0.1 
degree) over a 10-degree rotation of the feed frame around the beam axis. 
When the two orthogonal linear polarizations are of vastly different 
magnitude, the measured differential phase can involve a substantial 
error because of a phase quadrature component arising from cross-po- 
larization coupling. Even a polarization grid cannot effectively dis- 
criminate against an incident cross polarization of a magnitude much 
greater than that of the in-line polarization. 


IV. MEASUREMENTS OF THE 7-METER ANTENNA 


4.1 Transmitting sources 


To measure the gain and radiation patterns of the Crawford Hill 7-m 
antenna, a weatherproof box, which contains transmitting sources at 19, 
28.5, and 99.5 GHz was placed on an AT&T Long Lines tower at Sayre- 
ville, N.J., approximately 11 km from Crawford Hill. Two horn-lens 
antennas with polarization grids (providing cross-polarization dis- 
crimination better than 50 dB) are used for vertical and horizontal po- 
larizations at the lower frequencies. Each antenna has a frequency di- 
plexer and can transmit 19 and 28.5 GHz. Waveguide switches inde- 
pendently control the polarization or turn off each transmitter. The 
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Fig. 12—Terrain profile of boresight range for 7-meter antenna. 


antenna gains are 30 and 33 dB with half-power beamwidths of 6.5 and 
5 degrees at 19 and 28.5 GHz, respectively. The sources at 19.04 and 28.56 
GHz are 100-mw Gunn oscillators. 

An antenna consisting of two cylindrical reflectors!” with a dual-mode 
feed and a vertically polarized grid is used to transmit a 99.5-GHz signal 
from a 10-mw IMPATT source with only about 50-KHz FM noise. The 
oscillator is connected through an isolator directly to the feed horn. The 
antenna has a gain of 41.5 dB with a half-power beamwidth of 1 degree 
in the elevation plane. 

Because of the distant location, the source box is remotely controlled 
from Crawford Hill. The beams from three source antennas were initially 
co-aligned before installation on the Sayreville tower. The expected 
power received by the 7-m antenna was about —30 dBm at all three 
frequencies. 


4.2 Probing measurements of the incident field 


In evaluating measured results of a large aperture antenna, it is nec- 
essary to know the field distribution incident on the aperture from a 
distant transmitting source. The path profile of the Sayreville to 
Crawford Hill measuring range is shown in Fig. 12. Although the well- 
elevated transmitting and receiving sites provide a clear line of sight, 
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Fig. 13—Sample vertical scans for each of the five transmissions from source at Sayre- 
ville. Repeatable scans at 19 and 28.5 GHz indicate small reflections from the terrain. 
Uncorrelated scans at 99.5 GHz indicate atmospheric scintillations. 


there still exist potential reflectors and scatterers of the millimeter-wave 
energy transmitted from Sayreville. A carriage and radial track mech- 
anism was designed to permit a probe antenna to be moved along a di- 
ameter of a circular aperture to obtain field amplitude measurements. 
The incident field along four diameters of the 7-m aperture with 45- 
degree angular separation was scanned for each of the five possible states 
of transmission, i.e., vertical and horizontal polarizations at both 19 and 
28.5 GHz and vertically polarized 99.5 GHz. Each scan was made twice 
to check on repeatability. 

Figure 13 shows sample segments of scan pairs for all five transmis- 
sions. The good duplications of fluctuations at 19 and 28.5 GHz indicate 
that the systematic deviations in the field are results of specular re- 
flections and diffractions. At 99.5 GHz, very little correlation exists 
between scans on a given diameter. Here the fluctuations are mostly 
atmospheric scintillations rather than terrain reflection. One notes that 
the 99.5-GHz transmitting antenna at Sayreville has a much narrower 
beamwidth than those of lower frequencies and the electrically rougher 
terrain is much less specular. 
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The spatial variations of the scan data suggest that most of the scat- 
tering comes at large angles with respect to the transmission path. This 
observation has been indeed confirmed by angular spectra obtained from 
a Fourier transform of the data. Hence, the terrain reflections should 
be negligible in the received power of the 7-m antenna pointing directly 
toward the transmitting source. The effect on the measured patterns 
will be only minor disturbance on sidelobes in the elevation plane at 19 
and 28.5 GHz when the antenna is pointing below the source. 

The received power in a gain-standard horn, which has a gain of the 
same order as that of our probe horn, will exhibit similar fluctuations 
across the aperture as those of Fig. 13. Since the fluctuations at 99.5 GHz 
are mostly atmospheric scintillations, the uncertainty arising from this 
fluctuation can be suppressed by taking an average of a number of 
comparisons between the gain standard and the 7-m antenna. However, 
at 19 and 28.5 GHz, the fluctuations are caused by terrain reflections; 
to reduce the uncertainty here, an average needs to be taken of numerous 
horn locations over the aperture. Analysis of the data resulting from 
probing measurements indicates that scanning the horn over a 1.4-m 
aperture segment gives a standard deviation of 0.4 dB at 19 GHz and 0.34 
dB at 28.5 GHz. 


4.3 Prime focus measurements 


To provide an evaluation of surface tolerance as well as to locate the 
primary focal point of the 7-m reflector for subsequent installation of 
the subreflector, we first conducted 99.5-GHz prime focus measurements 
using a dual-mode feedhorn with 20-dB taper at the reflector boundary. 
The measured feed patterns are shown in Fig. 14. The focal region was 
probed with the feed until the best patterns were obtained. The mea- 
sured patterns in the azimuth and elevation planes are shown in Fig. 15 
together with the calculated pattern envelope assuming a perfect re- 
flector surface.2 The calculated patterns are approximately the same 
for azimuth and elevation. Expanded patterns not shown here indicated 
good agreement between measured and calculated half-power beam- 
widths (0.032 degree); however, the measured sidelobe levels are higher 
than the calculated values especially in the elevation plane. It is of in- 
terest to note that only near sidelobes in the elevation plane are higher 
than the corresponding lobes in the azimuth plane, whereas the far 
sidelobes in the two planes are essentially similar. Furthermore, the 
measured far sidelobe levels are consistent with an rms surface roughness 
of 0.1 mm. The excessive near sidelobe level in the elevation plane ap- 
pears to be caused by a surface distortion of large-scale size. Since the 
reflector panels were set using a two-section template, a relative mis- 
alignment of these two sections could cause the elevation patterns we 
observed. This conjecture has been confirmed by pattern calculations 
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Fig. 14—Measured patterns of 99.5-GHz dual mode horn (0.98-cm diameter aperture) 
for prime-focus feed. 


assuming a relative displacement (~0.15 mm) between two sections of 
the template. 

Gain measurements of 99.5 GHz for the 7-m offset reflector using 
prime focus feed were made by comparison with a calibrated gain stan- 
dard.!8 The comparisons were made at various times of day to obtain 
some estimate of the effects of scintillation. A time constant of about 
1 s was used in the measuring set to smooth out the scintillation. Mea- 
surements indicate that gain variations due to diurnal variations of 
scintillations are generally less than about 0.2 dB. This observation has 
been also confirmed by the repeatability of measured patterns. A mea- 
sured gain of 74.63 + 0.45 dB was obtained from a sample of 10 com- 
parisons with the gain standard taken on a clear quiet evening shortly 
after sunset. 

The comparisons were made by measuring the difference in signal 
received by the gain standard (30.77 dB) and the 7-m antenna padded 
by acalibrated attenuator (40.05 dB). The error estimates of 40.45 dB 
were obtained from the root-sum-square of the 3o random errors: 0.14 
dB for the calibrated attenuator, 0.16 dB for the gain standard, and 0.40 
dB for the sample mean of 10 comparisons. 

The theoretical gain of the antenna at 99.5 GHz having no roughness 
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Fig. 15—99.5-GHz scan over a 2 degree range measured with prime-focus dual-mode 
feed of 20-dB taper. The difference between measured far sidelobe levels and the calculated 
pattern envelope for perfect surface is consistent® with 0.1 mm rms surface tolerance. The 
higher near-sidelobe level in the elevation plane stems from large-scale surface distor- 
tion. 


is calculated as follows: 


77.26 dB = Area Gain 

—1.56 dB = Illumination Taper 

—0.08 dB = Spillover 

—0.2 dB = Feed Loss (Estimated) 

75.42 dB = Calculated Gain for Perfect Surface 


Using the formula e~47«/)”, where « is the rms surface tolerance, we see 
the difference between measured and calculated gains, (0.79 + 0.45) dB, 
corresponds to an rms roughness of (0.1 + 0.03) mm. 

Using a 20-dB taper corrugated horn (see Figs. 6 and 7) as a prime 
focus feed, we also measured the patterns of the 7-m offset reflector at 
28.5 GHz as shown in Fig. 16. Excellent agreement between measured 
and calculated patterns were obtained in the azimuth plane. However, 
as with 99.5 GHz, there was a noticeable discrepancy between measured 
and calculated sidelobe levels in the elevation plane. The first sidelobe 
is —-25 dB compared with —16 dB at 99.5 GHz. 

The measured gain at 28.5 GHz was obtained by padding the 7-m re- 
flector with a calibrated attenuator (39.93 + 0.05) dB, and comparing 
with a calibrated gain standard (24.98 + 0.08) dB.!8 Using 15 different 
locations for the gain standard, the measured gain was determined to 
be (64.7 + 0.6) dB. The large 3c error limit was the consequence of the 
data spread and is consistent with the results of the probing measure- 
ments. The expected prime focus gain at 28.5 GHz can be calculated as 
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Fig. 16a—Measured 28.5-GHz azimuthal pattern with prime focus feed of 20-dB taper 
showing excellent agreement with calculated pattern for perfect surface. 


follows: 66.42 dB = Area Gain 
—1.56 dB = Illumination Taper 
—0.08 dB = Spillover 
—0.1 dB = Feed Loss (Estimated) 
64.68 dB = Calculated Gain for Perfect Surface 


4.4 Subreflector alignment and measured results 


The hyperboloidal subreflector is required to be confocal and coaxial 
with the paraboloidal main reflector. The primary focal point has been 
given by pattern measurements using prime focus feeds. Before instal- 
lation of the subreflector, a laser beam was first fixed along the reflector 
axis. The subreflector was oriented with the aid of the laser beam re- 
flected from a mirror attached to the bottom of the subreflector, centered 
on and perpendicular to its axis. The position of the subreflector was 
adjusted by interpreting the measured 99.5-GHz patterns until they were 
consistent with the prime-focus-fed patterns. 

The Cassegrainian feed, used in the 99.05-GHz pattern measurements 
of the complete 7-m antenna including subreflector, consists of an offset 
ellipsoid and a dual-mode horn. The feed is essentially a scaled model 
of the 19-GHz offset launcher in the 19/28.5-GHz duo-polarization feed. 
Figure 17 shows that the measured 99.5-GHz feed patterns are almost 
the same as those of the 19/28.5-GHz feeds. Thus we can make direct 
comparison of measurements on the 7-m antenna at 99.5 GHz with the 
19- and 28.5-GHz performance. 
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Fig. 16b—28.5-GHz elevation pattern with prime-focus feed of 20-dB taper. The mea- 
sured first sidelobe level is -25 dB compared with —16 dB at 99.5 GHz in Fig. 15. 
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Fig. 17—Measured patterns of 99.5-GHz Cassegrainian feed consisting of an offset el- 
lipsoid and a dual-mode horn. 
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Fig. 18—99.5-GHz scan over a 2-degree range measured with Cassegrainian feed of 14-dB 
taper and compared with the calculated pattern envelope for perfect surface. The above 
patterns are consistent with those of prime-focus feed in Fig. 15. 


Measured patterns are found to be insensitive to the location of the 
feed phase center as expected from the very long effective focal length 
of the antenna. With the feed phase center located on axis, the beam 
pointing of the Cassegrainian configuration agrees with that of the prime 
focus configuration to within 0.02 degree. Measured 99.5-GHz patterns 
are shown in Fig. 18 together with the calculated pattern envelope!® for 
comparison. Measured half-power beamwidths agree with calculated 
values, whereas measured sidelobe levels are higher than calculated levels 
by about the same amount as in the prime focus measurements. Since 
the Cassegrainian feed has an illumination taper of 14 dB in contrast with 
the 20-dB taper of the prime focus feed, the sidelobe level is expected 
to be higher than that of the prime focus configuration. The measured 
first sidelobe level in the elevation plane is almost the same for prime 
focus and Cassegrainian configurations, because it is dominated by re- 
flector distortion rather than illumination taper. It is seen that, as with 
the prime-focus case, the discrepancy between azimuth and elevation 
patterns is confined to the near sidelobe region, whereas the far sidelobe 
levels of two patterns merge together. 

Pattern measurements were also taken for each of the four feeds in 
the duo-polarization 19/28.5-GHz quasi-optical feed assembly. The 7-m 
antenna was first tested with the 19/28.5-GHz feeds located at the on- 
axis position using both an oversized mirror of 38.10 by 53.34 cm oval 
shape and a smaller mirror of 30.48 by 43.18 cm. Measurements in each 
case showed the coincidence of four beam maxima for each polarization 
and frequency of the quasi-optical feed network. The smaller mirror 
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Fig. 19a—Measured 19-GHz azimuthal pattern with Cassegrainian feed of 15-dB taper 
and vertical polarization showing excellent agreement with the calculated pattern for 
perfect surface. 


showed very little truncation effect in measured patterns. Following 
these on-axis measurements, the feed box for the beacon receiver to- 
gether with the smaller mirror was moved to a position 0.5 degree off axis 
from the center of the vertex equipment room to allow clearance for an 
on-axis beam for millimeter-wave radio astronomy as shown in Fig. 9. 
As expected,*}4 measurements showed very little difference between the 
0.5-degree off-axis and on-axis beams for both 19 and 28.5 GHz. 

Measured cross-polarized radiation for each on-axis beam remained 
below —40 dB in all directions with respect to the in-line polarization 
maximum, while that for each 0.5-degree off-axis beam is smaller than 
—39 dB. One notes that the optimum orientations of the polarization-grid 
diplexer for nulling the cross polarization are about 0.7 degree apart 
between two orthogonally polarized feeds. The above cross-polarization 
data were measured using a compromise orientation of the polarization 
grid. 

Figures 19 and 20 show comparisons between measured and calculated 
patterns! at 19 and 28.5 GHz. The measured patterns were obtained 
from vertically polarized feeds at on-axis position with the smaller 45- 
degree mirror, and remained essentially the same for other combinations 
of polarization and mirror at both on-axis and 0.5-degree off-axis feed 
positions. Good match between calculated and measured azimuthal 
patterns is illustrated in Figs. 19(a) and 20(a), whereas the agreement 
between calculated and measured elevation patterns is, again, less sat- 
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Fig. 19>—Measured 19-GHz elevation pattern with Cassegrainian feed of 15-dB taper 


ane vertical polarization showing fair agreement with the calculated pattern for perfect 
surface. 


isfactory as shown in Figs. 19b and 20b. The measured sidelobes of 
all elevation patterns have generally shown, especially in Fig. 20b, a 
period twice that of the calculated value. This observation supports the 
conjecture about a relative misalignment of two sections of the template 
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Fig. 20a—28.5-GHz azimuth pattern with Cassegrainian feed of 15 dB taper and vertical 
polarization. The measured pattern is consistent with that in Fig. 16a for prime focus rece 
of 20-dB taper. 
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_ Fig. 20b—28.5-GHz elevation pattern with Cassegrainian feed of 15-dB taper and ver- 
tical polarization. The measured pattern is consistent with that in Fig. 16b for prime focus 
feed of 20-dB taper. 


as discussed in Section 4.3. Measured 19-GHz sidelobe levels 1 degree 
away from the beam maximum are —43 dB in the azimuth plane and —38 
dB in the elevation plane. These levels are of interest in avoiding inter- 
ference from future adjacent satellites in synchronous orbits. 


V. CONCLUDING REMARKS 


The expected performance of the Crawford Hill 7-m antenna and its 
associated feed systems has been realized. This antenna is the first large 
offset Cassegrain in operation. Comparison between prime-focus-fed 
and Cassegrain-fed patterns shows very little degradation due to any 
surface imperfection or misalignment of the subreflector. 

Comparison with a calibrated gain standard showed the difference 
between the 99.5-GHz measured and calculated prime-focus gains to 
be (0.79 + 0.45) dB, which implies an rms surface error of about 0.1 mm. 
The measured 99.5-GHz azimuthal pattern appears to indicate a surface 
error of this magnitude, whereas the elevation pattern shows skewed 
shape and unexpectedly high near-in sidelobe level. Computer simula- 
tions have indicated that the distorted elevation pattern can be caused 
by a relative displacement (~0.15 mm) between two sections of the 
template used to calibrate the reflector panels. This explanation is also 
consistent with the measured gain because it is only accompanied by a 
small gain reduction (~0.15 dB). . 

Pattern measurements using the quasi-optical 19/28.5 GHz dual- 
polarization feed assembly have shown the coincidence of the four beams. 
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Table | — Derived Cassegrainian gain in decibels with 15-dB feed 


taper 

Frequency (GHz) 99.5 28.5 19 
Area Gain 77.26 66.4 62.88 
Illumination Taper —0.93 —0.93 —0.93 
Spillover* —0.35 —0.4 —0.45 
Feed and Multiplexingt —0.2 —0.2 —0.2 
Surface Tolerance —0.8 —-0.1 —0.05 
Derived Gain 74.98 64.77 61.25 
Error Estimates +0.5 +£0.25 +£0.25 
Gain Efficiency 59.2% 68.7% 68.7% 


* The estimates for spillover loss are higher at 19 and 28.5 GHz because of truncations 
in the quasi-optical 19/28.5 GHz duo-polarization feed. 

* No multiplexing is involved at 99.5 GHz, whereas both frequency and polarization 
diplexing losses are included in the estimates at 19 and 28.5 GHz. 


Cross-polarized radiation is —40 dB or less in all directions throughout 
each beam. The measured results have now confirmed the theoretical 
prediction? that there should be very little cross-polarized radiation from 
an offset Cassegrainian antenna with a large effective F/D ratio if the 
feed radiation is free of cross polarization. 

Good agreement between calculated and measured patterns is shown 
in the azimuthal plane, whereas the comparison is less satisfactory in 
the elevation plane. At 1 degree away from the beam maximum, the 
19-GHz sidelobe level is —43 dB in the azimuth plane and —38 dB in the 
elevation plane. Since the synchronous orbit is seldom close to the ele- 
vation plane of a ground station antenna, these measured sidelobe levels 
have practically achieved the objective of 40-dB discrimination between 
adjacent synchronous satellites at 1-degree spacing. 

The gain measurement of the Cassegrainian configuration is hampered 
by the sensitivity of the harmonic mixer to the temperature difference 
between indoor and outdoor environments and by a lot of cable move- 
ment in addition to the terrain reflection problem at 19 and 28.5 GHz. 
However, having determined the main reflector surface tolerance by 
prime focus measurements, we can derive the Cassegrainian gains as 
shown in Table I. 

Multiple-beam operation has been achieved with a 0.5-degree off-axis 
beam for beacon feed and an on-axis beam for millimeter-wave radio 
astronomy. Measurements showed very little difference between 0.5- 
degree off-axis and on-axis beams at both 19 and 28.5 GHz. 
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This paper describes the receiving electronics built at the Bell Lab- - 
oratories Crawford Hill facility at Holmdel, New Jersey to use the 19- 
and 28-GHz beacons on the COMSTAR satellites for propagation mea- 
surements. The receiving system accurately determines attenuation, 
differential phase, depolarization, bandwidth limitations and angular 
scatter of these signals produced by rain. This highly reliable system 
operates continuously and unattended; it automatically reacquires the 
beacon signals after dropout due to severe attenuation or momentary 
power outage. Correlations among strong and weak signal components 
are used to permit detection of weak cross-polarized signals during 
severe fading. Receiver noise bandwidths as low as 1.6 Hz are used. A 
high degree of phase stability is achieved in all circuits and compo- 
nents. 


I. INTRODUCTION 


The receiving electronics for the COMSTAR beacons has placed strong 
demands on technology in several areas in order to meet the require- 
ments of the propagation experiments.!2:3 The receiving system built 
at the Bell Laboratories Crawford Hill facility includes a precision an- 
tenna‘ and the receiving electronics necessary to make maximum use 
of the 19- and 28-GHz beacon signals radiated by the COMSTAR satel- 
lites. The receiving electronics is the subject of this paper. 

Continuous unattended operation is required so that all significant 
weather events are included in the resulting data base; thus, a very high 
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degree of reliability in the receiving electronics is required, and automatic 
reacquisition of the beacon signal after dropout due to severe attenuation 
or momentary power outage is essential. Since relative phases of the 
many signal components must be precisely measured, the phase stability 
of all circuits and components demanded careful attention. Also, circuit 
arrangements had to be devised to ensure that signals which were later 
to be compared in phase traversed a common path through high-gain 
amplifiers and other phase-sensitive equipment. 

In order to obtain the maximum possible measuring range using the 
modest powers radiated by the satellite beacons, very narrow receiver 
noise bandwidths are required. This puts a premium on the stability of 
the source oscillators in the satellites and the local oscillators in the earth 
stations. The receiver includes an AFC circuit with built-in memory to 
facilitate reacquisition after loss of signal. The feature also permits easy - 
return to propagation measurements after use of the antenna system 
for radio astronomy studies during clear weather periods. Maximum use 
was made of known correlations among strong and weak signal compo- 
nents to permit detection of weak cross-polarized signals during severe 
fading. The following is an account of how these objectives were achieved, 
together with a description of the resulting apparatus and its operating 
characteristics. General design considerations are in Section II; Sections 
III and IV cover 19-GHz and 28-GHz receiver channels, respectively. 
Local oscillators and frequency control techniques are discussed in 
Section V. Section VI covers polarization switch synchronization. Data . 
collection equipment is described in Section VII; Section VIII covers the 
receiver calibration source. Receiver performance and some sample data 
are included in Section IX. 


ll. RECEIVER DESIGN CONSIDERATIONS 


The multichannel satellite beacon receiver must measure and record 
the signal amplitudes and phases listed in Table I for all rain events in 
order to satisfy the needs of the propagation experiment. Because of 
beacon! and rain® characteristics the receiver must: (i) have narrow (1.6 
to 24 Hz) final noise bandwidths, (iz) keep the receiver frequencies within 
the narrow IF filter bandwidths (BW) as the beacon frequencies vary, (iii) 
hold local oscillator (LO) frequencies within a few Hz of their last known 
frequency for several minutes when rain attenuates the beacon signals 
below the frequency tracking threshold or when the primary power drops 
out, (iv) discriminate against the 1-kHz polarization switching sidebands 
while reacquiring the beacon signals automatically after long periods 
of loss of signal, (v) synchronize receiver polarization switches with the 
beacon polarization switch, and (vi) perform these functions reliably and 
automatically to permit continuous unattended operation. During clear 
air conditions the earth-station antenna‘ is used for radio astronomy 
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Table | — Signal amplitudes and phases 


Receiver output Description 
Amplitude, 19 GHz 
A19H10 Horizontal copolarized,* 10-Hz IF BW 
A19V10 Vertical copolarized,t 10-Hz IF BW 
A19XH10 Horizontal crosspolarized,! 10-Hz IF BW 
A19XH1 Horizontal crosspolarized,! 1-Hz IF BW 
A19XV10 Vertical crosspolarized,tt 10-Hz IF BW 
A19XV1 Vertical crosspolarized,tt 1-Hz IF BW 
A190A1 Off-axis receiving beam, 1-Hz IF BW 
Phase difference, 19 GHz 
o19V-H (Vert. copol.t)-(horiz. copol.*), 10-Hz IF BW 
o19V-XV (Vert. copol.t)-(vert. crosspol.*t), 10-Hz IF BW 
~19V-XH (Vert. copol.t)-(horiz. crosspol.?), 10-Hz IF BW 
Amplitude, 28 GHz 
A28V15C Vertical copolarized,*t 15-Hz IF BW, carrier 
A28V1.5C Vertical copolarized,t 1.5-Hz IF BW, carrier 
A28V15U Vertical copolarized,t 15-Hz IF BW, upper sideband 
A28V15L Vertical copolarized,t 15-Hz IF BW, lower sideband 


A28XV15C Vertical crosspolarized,tt 15-Hz IF BW, carrier 
A28XV1.5C Vertical crosspolarized,tt 1.5-Hz IF BW, carrier 
A280A1.5 Off-axis receiving beam, 1.5-Hz IF BW 
Phase difference, 28 GHz 
28VC-XVC (Vert. copol.t carrier)-(vert. crosspol.tt carrier), 15-Hz IF BW 
o28VC-U (Vert. copol.t carrier)-(upper sideband), 15-Hz IF BW 
o28VC-L (Vert. copol.t carrier)-(lower sideband), 15-Hz IF BW 
Phase difference, crossband 
$19V-28VC (19-GHz vert. copol.,* 10-Hz IF BW) 
—(28-GHz vert. copol. carrier, 15-Hz IF BW) 


* Horizontal copolarized is transmit horizontal and receive horizontal. 
t Vertical copolarized is transmit vertical and receive vertical. 

t Horizontal crosspolarized is transmit horizontal and receive vertical. 
tt Vertical crosspolarized is transmit vertical and receive horizontal. 


and other studies so the receiver must be easily restarted by people not 
intimately familiar with it. 

Signal amplitudes and phases must remain accurate over receiver 
temperature changes of +15°C and for signal attenuations of >30 dB. 
For example, amplitude and phase differences between the two copo- 
larized 19-GHz channels must remain within 0.5 dB and 2°.6 This re- 
quires careful attention to differential temperature control and to lin- 
earity. 

Minimizing the number of frequency conversions is desirable. to 
minimize receiver complexity and the number of spurious mixing 
products. 

Since the 19- and 28-GHz beacon signals are derived from a common 
oscillator, they have the same frequency fluctuations. Thus, extended 
measuring range can be provided in the 28-GHz channels and in low 
signal 19-GHz channels (off axis and cross-polarization) if: (7) the cor-: 
responding 28-GHz and 19-GHz receiver LO sources are common, (ii) 
frequency fluctuations in the beacon and in LOs are tracked out witha 
common oscillator in a loop locked in frequency or phase to the 19-GHz 
vertically polarized signal, the signal that will experience the least at- 
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tenuation,° and (iii) very narrow band (~1 Hz BW) IF filters are used in 
the extended range channels. The provisions necessary for extending 
measuring range are easier to implement if all 19- and 28-GHz receiver 
IFs and LOs are in a 2:3 ratio. 

The antenna has a small equipment room, the vertex cab, near the 
vertex of the main reflector, above both the azimuth and elevation axes; 
another room, the side cab, is to one side of the elevation axis but above 
the aximuth axis. The control building is about 15 meters (50 feet) away 
from the antenna at ground level (see Ref. 1, Fig. 3). The receiver is 
distributed among the three equipment locations to optimize noise 
performance and phase and amplitude stability, taking into account 
space limitations in the various equipment rooms. 

The receiver is packaged by functional groups, e.g., bandpass filters, 
mixers and IF amplifiers for all the second frequency conversions are 
packaged together. This packaging approach maximizes differential 
phase and amplitude stability by minimizing temperature differential 
between similar components in different channels and also allows the 
receiver to be built in many separable blocks that may be “debugged” 
individually without complex interaction. 

Since power line transients and momentary power outages are ex- 
pected during heavy rain, all oscillators, filter stabilizing ovens and 
frequency memory registers are powered by batteries charged contin- 
uously from the power line. 


lil. 19-GHz RECEIVER CHANNELS 


The 19-GHz receiver channels from the antenna feeds through the 
second IF crystal filters are shown in Fig. 1. The 19-GHz beacon signals 
are received with vertically and horizontally polarized feeds whose re- 
sulting antenna beams are pointed at the satellite. Another 19-GHz feed 
is located so its antenna beam is pointed toward an unoccupied syn- 
chronous orbit location about 0.74° off-axis from the satellite. This 
off-axis beam detects signals scattered by rain from the satellite beacon 
path into other potentially useful paths. This scattered signal is another 
possible source of cochannel crosstalk in multisatellite systems.”8 

Test signals from a calibration source described in Section VIII are 
fed into directional couplers between the antenna feeds and the first 
mixers. These calibrated test signals have adjustable amplitudes, are 
polarization switched and have adjustable simulated “crosspolarization” 
levels. The short term frequency stability of the calibration source is 
representative of the stabilities of the satellite beacons. 

Signals are mixed with the 18.037-GHz first LO in the Schottky-diode 
balanced first mixers. Parametric amplifiers are not used because (7) they 
would contribute excessive amplitude and phase instability and (ii) 
adequate measuring range is more economically and easily provided by 
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narrow-band IF filtering. These mixers and associated low-noise 
broadband IF preamplifiers (~25 dB gain) are mounted on the feed 
horns. The first mixer and IF preamplifier single sideband noise figure 
(NF) of approximately 6.5 dB essentially determines the overall receiver 
NF. The remainder of the receiver contributes <0.1 dB. The first LO is 
distributed from a common source to all 19-GHz channels to preserve 
phase in the mixing process and to insure that the frequency fluctuations 
are correlated among the channels; isolating power dividers and ferrite 
isolators (see Fig. 5) insure >70 dB isolation between receiver channels 
through this common LO path. The 1.003 GHz first IF is dictated by the 
1.056 GHz required bandwidth for the modulation sidebands in the 28 
GHz receiver, by the need to keep 19- and 28-GHz IFs and LOs in a 2:3 
ratio, and by the need to derive corresponding LOs from common 
sources. 

After 30 dB more IF amplification the vertically and horizontally 
polarized channels are transfer-switched in synchronism with the beacon 
polarization switch. In this switch the copolarized signals (V and H) in 
adjacent time slots are switched into one receiver channel and the 
cross-polarized signals (XV and XH) are switched into the other channel. 
The accuracy of the differential amplitude and phase between the co- 
polarized signals is very critical in the calculation of depolarization for 
other polarization angles.® The transfer switching insures that the phase 
and amplitude variations in most of the filters, amplifiers and long cable 
runs in the copolarized signal channel will affect the V and H signals 
identically. Thus, these variations will not affect the differential am- 
plitude and phase measurements. 

As indicated in Figure 1, the first mixers, first IF amplifiers and 
transfer switch are on the antenna feed assembly in the vertex cab. The 
mechanical rigidity of the feed assembly is adequate to ensure <0.5° 
differential phase variation (measurement limits) due to differential 
mechanical motion of feed components. Thermal differential phase 
variation for the feed assembly and associated receiver components is 
<0.2° per °C. 

A separate 19-GHz receiver channel, used for signal acquisition and 
polarization switch synchronization, branches off before the transfer 
switch. 

Cables run from the vertex cab to the side cab after the transfer switch 
but before 0.8-GHz BW bandpass filters (BPFs); these filters constrain 
the noise bandwidth to prevent noise from saturating the following 
broadband IF amplifiers and are mounted on an aluminum plate to 
minimize the temperature differential between them. 

The second mixers reject image noise by more than 19 dB by phase- 
cancelling techniques (single sideband mixing) to permit the large 
fractional frequency spread between the first IF (1.003 GHz) and the 
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second IF (2.067 MHz). The 1.001 GHz second LO also is distributed from 
a common source to all 19-GHz channels. Isolating power dividers and 
ferrite isolators (see Fig. 6) insure >65 dB isolation between receiver 
channels through this common LO path. The second LO automatic fre- 
quency control (AFC) that follows average long-term frequency changes 
is described in Section V. The 5-MHz Bw of IF amplifiers following the 
second mixers is narrow enough to prevent amplifier saturation on noise 
but wide enough both to preserve the rise time of the polarization 
switched signals and to render filter phase stability of little concern. The 
second IF amplifiers drive 50 meters of cable from the side cab to the 
control building. 

Step attenuators match signal levels between the constant-gain low 
level (signals S < noise N) front part of the receiver and the constant- 
gain high level (S > N) back part. This permits maximizing dynamic 
range by setting the clear air signal outputs near maximum. The second 
IF polarization switches separate the time-sequenced signals (V, H, XV, 
and XH) into separate channels. After further amplification, quartz 
crystal BPFs do the following: (i) further restrict the noise bandwidth, 
(ii) reject image frequency noise >20 dB for the third frequency con- 
version, (iii) significantly increase the rise time of the signal pulses and 
(iv) reduce second IF switch transients at the third conversion image 
frequency. The 6-kHz Bw of these filters is as narrow as possible con- 
sistent with differential phase stability requirements. The four filters 
for the main beam channels (V, H, XV, and XH) are mounted together 
in asolid aluminum enclosure to minimize temperature differences. 

The 2.067-MHz second IF frequency is a compromise restricted on the 
low side by LO noise close to the second LO frequency, by the need to 
preserve the rise time of the polarization-switched signals, and by the 
level of switching transients near the second IF frequency. The choice 
is restricted on the high side by phase stability considerations for the 
crystal BPFs and by the desire to minimize the number of receiver fre- 
quency conversions. 

The acquisition and polarization switch synchronization channel 
preserves the 1-kHz polarization-switching signal from the beacon, 
uncontaminated by receiver switching. This channel is similar to the 
other receiver channels down to the crystal BPFs, except for the omission 
of switches. The acquisition and polarization switch synchronization 
circuits described in Sections V and VI contain their own automatic gain 
controlled IF amplifiers and crystal BPFs. 

The off-axis rain scatter channel is essentially identical to the main 
beam channels except for the omission of polarization switches. 

The 19-GHz receiver channels, continuing from the second IF crystal 
BPFs through the amplitude and phase detectors, are shown in Fig. 2. 
As indicated earlier, image rejection for the third frequency conversion 
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is done in the 6-kHz crystal BPFs. These stable high-Q filters permit a 
large fractional frequency spread between the second IF (2.067 MHz) 
and third IF (6.25 kHz). The 6.25 kHz third IF is constrained by phase 
stability considerations in the crystal BPFs and the need for at least 20 
GB of filter rejection at the image frequency. The third mixers are double 
balanced (ring diode) with >30 dB signal to LO and LO to signal isolation. 
This isolation, along with the isolation in the LO power dividers (see Fig. 
8), produces >80 dB isolation (measurement limit) between receiver 
channels through the common third LO path. 


Because the third Lo is phase-locked to the V copolarized signal 
channel using the phase locked loop (PLL) described in Section V, the 
short-term instabilities of the beacon oscillator and all receiver LOs are 
removed from all receiver channels at the third mixers. Thus, all third 
and fourth IF signals are as stable as the 325-Hz PLL reference as long 
as the PLL is locked. 


Single resonator (single pole pair) active BPFs with 250 Hz Bw follow 
6.25 kHz IF amplifiers. These filters further restrict the noise BW, further 
reduce the polarization switching sidebands and receiver switching 
transients, and reject image noise and switching transients for the fol- 
lowing fourth frequency conversion. The high input impedance of active 
analog multipliers used for the fourth mixers combined with the low 
output impedance of the operational amplifier supplying the 5.925-kHz 
fixed-frequency fourth LO (see Fig. 8) result in >80 dB isolation between 
receiver channels through the common fourth LO path. 

Single resonator, 10-Hz 3-dB BW, active BPFs (~16 Hz equivalent 
noise BW) follow the fourth mixers. These BPFs are the final IF filters 
for the copolarized signal channels and for phase and amplitude mea- 
surement to moderate attenuation levels in the crosspolarized signal 
channels. 

The specific third and fourth IFs are chosen to minimize the levels of 
the mixing products from the 1-kHz switching sidebands, both on the 
signal and the image sides of the third and fourth Los. More than 60 dB 
filtering of the 1 kHz switching sidebands by the cascaded third and 
fourth IF BPFs insures that the phase ripple produced by the sidebands 
cannot exceed +0.1°. 


All of the third IF and 10-Hz fourth IF active BPFs and the fourth 
mixers are constructed with low-temperature-coefficient components. 
They also are enclosed in a temperature stabilized oven with <14°C in- 
ternal temperature variation to maintain phase stability. The third and 
fourth IF filter Qs are nearly the same so that, assuming the same com- 
ponent variations, their contributions to the overall phase stability would 
be nearly equal. These filters were aged at oven temperature (65°C) 
before final alignment. The fourth IF BPF outputs are buffered through 
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Fig. 83—28-GHz receiver channels: antenna feeds through second IF. 
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low-pass high-pass operational amplifiers that suppress low frequency 
(1/f) noise and broadband noise in the active BPF outputs. 

Linear amplitude detectors following the fourth IF filters are full wave 
rectifiers comprising diodes in feedback paths of wideband low-drift 
operational amplifiers. These detectors are linear in dc output vs. signal 
input (voltage) to within +0.1 dB over >60 dB signal range and a +15°C 
temperature range. The phase detectors are commercial units with 
limiters (<+1° over 60 dB) in each channel followed by phase-to- 
pulse-width circuits and low-pass filter integrators. They measure phase 
unambiguously over 360°. 

Because the PLL in the third LO removes short-term oscillator fluc- 
tuations from the fourth IF signal and because the XV, XH and rain 
scatter signal levels should always be less than the V copolarized signal 
level, the amplitude measuring range of the XV, XH, and rain scatter 
channels is extended by active BPFs with 1-Hz 3-dB Bw. The bandwidth 
of these filters, which are similar to the 10-Hz Bw filters and also are 
enclosed in an oven, is limited to about 1 Hz by the expected maximum 
rates of change of signal parameters.” 

Again, the off-axis rain scatter channel is packaged separately and is 
essentially identical to the main beam crosspolarized signal channels 
except for the omission of the 10-Hz Bw fourth IF BPF and the phase 
detector. 

The outputs from the amplitude and phase detectors feed the de 
signal-conditioning amplifiers in the data collection equipment in 
Fig. 16. 


IV. 28-GHz RECEIVER CHANNELS 


The 28-GHz receiver channels from the antenna feeds through the 
second IF crystal BPFs are shown in Fig. 3. The 28-GHz copolarized 
beacon signal (V) is received with a vertically polarized feed whose re- 
sulting antenna beam is coaxial with the 19-GHz beams. A horizontally 
polarized feed, whose beam is coaxial with the vertically polarized beam, 
receives the crosspolarized signal component (XV). These 28-GHz feeds 
share the main beam feed frame‘ with the 19-GHz main beam feeds. A 
28-GHz off-axis rain scatter feed produces a beam coaxial with the 
19-GHz off-axis beam. 

Since many of the components and functions in the 28-GHz channels 
are similar to the corresponding ones in the 19-GHz channels, descrip- 
tions of similar components and functions are not repeated here. The 
major differences between the 19- and 28-GHz channels result because 
there is no polarization switching at 28 GHz but there are sidebands 
coherent with the 28-GHz carrier.! These sidebands (+264 MHz for two 
satellites and +528 MHz for another) are used for measuring amplitude 
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and delay dispersion.! Component locations discussed under 19-GHz 
receiver channels are indicated on Fig. 3. 

As discussed earlier, 19- and 28-GHz receiver IFs and LOs are in a 2:3 
ratio and corresponding LOs are derived from the same sources for ex- 
tending measurement range and to permit measuring phase between 19- 
and 28-GHz carriers. This phase measurement yields the spectrum of 
east-west satellite pointing angle fluctuations because of the interfer- 
ometer formed by the 2 meter (6.6 ft.) east-west separation between the 
19- and 28-GHz satellite antennas. The measurement also yields dis- 
persion information for a large frequency separation (9.5 GHz). In order 
to minimize effects of frequency changes on this phase measurement, 
all BPFs with significant delay (e.g., the crystal BPF) also have a 2:3 center 
frequency and bandwidth ratio between 19- and 28-GHz channels. 

The calibration source provides calibrated test signals for all of the 
28-GHz channels; sidebands coherent with the carrier and with either 
+264 MHz or +528 MHz separation can be selected. 

A single fixed-frequency LO converts the entire 28.56 + 0.528 GHz 
band, encompassing both sets of sidebands and the carrier, to the first 
IF centered at 1.5 GHz. This ensures that the short-term frequency 
stability of the most critical first LO is not compromised by other re- 
quirements. The first mixer and IF preamplifiers, which are similar to 
the 19-GHz units, have noise figures (SSB) of <7 dB over the entire 1.5 
+ 0.528 GHz band. The first IF is a compromise that is pushed toward 
low frequencies by IF preamplifier noise-figure considerations and 
toward high frequencies both by the need to pass the wide bandwidth 
and by first LO noise considerations. Since many standard octave 
bandwidth components are available for the 1- to 2-GHz band, the 28- 
GHz receiver first IF band is set at 1.5 + 0.528 GHz. This constraint, 
along with the 2:3 ratio IF and LO constraint, fixes the first IF of the 
19-GHz receiver at 1 GHz. The specific IF frequency, 1.504 GHz, results 
from maximizing the frequency separation between the desired IF signals 
and spurious mixing products from this and later frequency conversions. 
Isolation between receiver channels through the common first LO path 
(see Fig. 5) is >65 dB. Sidebands (SBs) are split into separate channels 
in the first IF so the bandwidth of the remainder of the receiver can be 
narrowed to increase sensitivity. (With the existing radio link parameters 
the maximum S/N in a 1-GHz Bw is <—20 dB.) The upper sideband 
(USB) channel uses the same 0.4 GHz BW BPF, broadband first IF am- 
plifiers, second mixer, second IF amplifiers and crystal BPFs for both the 
+264 and +528 MHz usss. The corresponding parts of the LSB channel 
also are the same for both sideband frequencies. The same narrowband 
crystal filters are usable because the third LO SB frequency corrections, 
discussed in the next paragraph and in Section V, are done in the second 
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LOs. This second LO correction keeps the frequency spread between the 
different SBs small (<+200 Hz) at the second IF. 

The measurement of delay dispersion! requires measuring phase be- 
tween the carrier and the USB scaled by 108/109 or 108/110 and between 
the carrier and the LSB scaled by 108/107 or 108/106. These scaling 
factors are the exact ratios between the carrier and the SBs for the dif- 
ferent satellites. Preserving the carrier and SBs in these ratios throughout 
the receiver requires separate LOs for carrier and SBs scaled similarly 
in frequency. Simple mixing by a single LO, as is done in the first fre- 
quency conversion, does not preserve the ratios. Therefore, sideband 
LO frequency corrections must be made elsewhere for this conversion 
and any other such conversion. The SB frequency corrections for the first, 
second, and third LOs are made to the SB second LO frequencies. This 
is done by adding or subtracting the “missing” portions of these LOs [(1 
or 2)/108 X fio] to the second LO used for carrier mixing. The LO im- 
plementations in Fig. 6 for making the corrections are described later. 
Provisions for remotely switching the SB channel LOs in the side cab from 
+264 to +528 MHz for the different satellites are provided in the control 
room. Isolation among the V carrier, XV and off-axis channel through 
their common LO path is >61 dB. 

The 28-GHz off-axis rain scatter channel is identical to the 28-GHz 
XV channel and packaged together with the 19-GHz off-axis channel. 

The 28-GHz receiver channels, continuing from the crystal BPFs 
through the amplitude and phase detectors, are illustrated in Fig. 4. Since 
the SB LO frequency correction for the third LO is included in the second 
LO, the third LO is common to all 28-GHz channels. Isolation among 
28-GHz channels through the common third LO path (see Fig. 8) is 
provided in the same way as in the 19-GHz channels and is >80 dB 
(measurement limit). 

In the USB channel the same broadband third mixer and third IF 
amplifier are used for both USB frequencies. The same is true for the LSB 
channel. 

Although not shown separately in Fig. 4, the active third IF and fourth 
IF BPFs and fourth mixers are separate for the USB channels for the 264 
and 528 MHz sBs. These components are separate also for the corre- 
sponding LSB channels. The fourth Los for the four SB channels include 
the fourth LO frequency corrections (see Fig. 10). Separate channel filters 
are used at this point in the receiver because the different SB frequencies 
are spread approximately 1 percent in frequency (1/107 and 1/109) and 
the BPF 3-dB Bws are only 3 percent. 

The 1.5-Hz BW BPFs in the XV and V carrier channels provide mea- 
suring range extension as described for 19-GHz receiver channels. 

The active filters and fourth mixers for the XV and V carrier channels 
and for the +264 MHz USB and LSB channels are in one oven. The +528 
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Fig. 5—Receiver first local oscillator (crystal oscillator frequency is 5.218991 MHz). 


MHz USB and LSB channels and the 1.5-Hz BW active BPFs are in an- 
other oven. 

In the SB frequency scalers, the SB frequencies are scaled by 107/108 
and 109/108 or by 106/108 and 110/108 for the delay dispersion phase 
measurement. The sideband signals are limited and scaled by PLL fre- 
quency multipliers and digital dividers. 

The crossband phase measurement is made between the 28.56- and 
19.04-GHz V-copolarized carriers. The 325-Hz fourth IF signal from the 
19-GHz receiver channel (Fig. 2) is divided by 2 and the 487.5 Hz fourth 
IF signal from the 28-GHz receiver channel (Fig. 4) is divided by 3 before 
the crossband phase detector (Fig. 4). 

The off-axis rain scatter channel is similar to the XV channel. Both 
the 19- and 28-GHz off-axis channels are packaged together and are 
separate from the mainbeam channels. 


V. LOCAL OSCILLATORS AND FREQUENCY CONTROL TECHNIQUES 


Frequency requirements for the LOs and the receiver channel-to- 
channel isolation through the common LO paths are discussed in the 
previous sections. 


5.1 First LO 


The first LO is shown in Fig. 5. A 5.218991-MHz high stability (1 x 
10-10/day, 1 X 10—12/0.1 sec) quartz crystal oscillator is multiplied first 
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by 48 to 250.5 MHz for the second LO sideband correction frequency 
described earlier and then by 36 to 9.019 GHz; further multiplication 
by 2 or by 3 produces the coherent LO signals for the 19- and 28-GHz 
receiver channels. Passive varactor frequency multipliers follow the 
power amplifier at 501 MHz. The first LO does not include frequency 
control so the stability of the basic crystal oscillator is preserved. The 
first LO frequency multipliers from 62.6 MHz to 18 and 27 GHz, power 
dividers, isolators, and waveguide distribution are mounted on the an- 
tenna feed assembly in the vertex cab. 


5.2 Second LO 


The second LO is derived from a 5.000000-MHz quartz frequency 
standard as shown in Fig. 6. Multiplication by 3 produces a component 
of the sideband correction frequency at 15 MHz. This signal is multiplied 
by 36 to 540 MHz where the frequency of a 39-MHz frequency synthe- 
sizer is subtracted in a mixer. The synthesizer frequency is controlled 
by the automatic frequency control loop (AFC) indicated on Fig. 1 and 
described later in this section. Multiplication by 2 and by 3 after the 
mixer produces the coherent second LO signals for the 19- and 28-GHz 
receiver channels. 

The second LO does the first, second, and third Lo frequency scaling 
described earlier for the sideband channels of the 28-GHz receiver. This 
scaling starts by adding 15 MHz from the second LO frequency multiplier 
and 28.6 KHz from the third LO (i.e., 1.08 MHz + 36) in a mixer; these 
are both 1/108 of the contribution of their respective sources to the 
second and third LOs. The synthesizer output divided by 36 to 1.096 MHz 
is then subtracted from the correction term since the synthesizer fre- 
quency itself is subtracted in the 1.562-GHz LO multiplier chain; this 
1.096 MHz is also 1/108 of the synthesizer contribution to the 1.502-GHz 
LO. The resulting 13.933 MHz correction term, containing the contri- 
butions from the third LO and both components (oscillator and syn- 
thesizer) of the second LO, is then added to the 250.5 MHz = 27,056/108 
MHz correction term from the first LO. The resulting 264.4 MHz cor- 
rection term, (1/108) (fL01 + fLo2 + fLo3), is then added to the 1.502-GHz 
second LO frequency to produce the 1.766-GHz upper-sideband second 
LOU, described in the section on 28-GHz receiver channels. The cor- 
rection term also is subtracted from the second LO frequency to produce 
the corresponding second LOL at 1.237 GHz. 

The 264.4-MHz correction term is multiplied by 2 and similarly added 
and subtracted to provide second LOU and LOL for the beacons with 
+528.8-MHz sideband separation. 

The order of mixing of the correction frequencies makes the filtering 
- easiest. 
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5.3 AFC loop 


An AFC loop controlling the second LO is used to remove long-term 
frequency variations in both the satellite and the other receiver LOs. This 
AFC loop must meet the following requirements: 


(1) Track over +200 KHz. 
(ii) Track a +1.1 Hz/sec frequency ramp. 
(iii) Maintain +10-Hz accuracy while tracking. 
(tv) Have a frequency averaging time of around 1 second. 
(v) Hold frequency on command for 10 minutes to within +50 
Hz. 
(vi) Contribute negligible short-term frequency instability. 


Requirements (i) and (ii) are set by the expected long-term frequency 
behavior of the beacon.! Requirement (iii) guarantees that the phase- 
locked loop used for short-term frequency correction need not have a 
large capture range. Requirement (iv) constrains the AFC loop to average 
over most short-term frequency variations and track only long-term 
effects. This will improve the loop’s low-SNR tracking performance and 
allow a better estimate of the signal’s average frequency during outages. 
Requirement (v) states that the receiver frequency stability during signal 
outages of up to 10 minutes will be much better than the satellite’s; thus, 
the frequency region which must be searched during reacquisition is 
determined only by the satellite. 

These requirements impose severe stability and tunability require- 
ments on the loop frequency generation element. A digitally controlled 
frequency synthesizer provides frequency memory and long-term sta- 
bility equal to that of its reference oscillator and thus was chosen for this 
application. 

It is a simple “fact of life” that a fixed-frequency oscillator has better 
short-term frequency stability than a variable-frequency oscillator, be 
it a VCXO or synthesizer. To prevent the synthesizer from adding to re- 
ceiver short-term instability, a relatively low-frequency synthesizer is 
mixed with a high-frequency fixed source, as discussed previously. In 
addition, a synthesizer exhibiting low phase noise was chosen for this 
application. The exact synthesizer and first IF frequencies were chosen 
to insure that any internally-generated signal-frequency or image-fre- 
quency spurious signals would be well outside IF filter passbands. 

The circuitry for controlling the synthesizer output frequency is shown 
in Fig. 7. The 2.067-MHz IF from the 19-GHz vertically polarized feed 
is filtered to remove the 12.5-kHz image response created by the fol- 
lowing conversion. The filter output is levelled by an AGC amplifier to 
keep the discriminator input level constant. Since the clear-air S/N at 
the filter output is ~30 dB, the AGC will be noise-dominated only for 
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Fig. 7—Receiver second local oscillator AFC circuitry (digital portion). 


fades of 30 dB or more, and will hold the signal component of its output 
constant for fades of lesser depth. . 

To obtain the desired narrow discriminator bandwidth, the signal is 
downconverted with a fixed-frequency LO to 6.25 kHz and fed to two 
20-Hz-wide active bandpass filters tuned 20 Hz above and below center 
frequency. These filters are enclosed in a temperature-controlled oven 
to enhance their long-term frequency stability. The filter outputs are 
envelope-detected and the detector outputs summed and differenced. 
The difference is divided by the sum voltage in an analog divider. This 
division operation provides the effect of additional AGC and widens the 
capture range of the complete AFC loop by “propping up” the tails of the 
discriminator S curve. The 6.25-kHz signal, with 1-kHz polarization- 
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switching sidebands, is fed also to the acquisition detector described 
later. 

The discriminator output is filtered by an integrator which is reset 
once per second. The dead-zone comparator produces an output pulse 
on one or the other of two lines if the integrator output indicates an av- 
erage frequency error of more than 2 Hz. These pulses are used to in- 
crement or decrement a six-decade digital up-down counter; the BCD 
counter controls the synthesizer output frequency to 1-Hz resolution. 
Since the synthesizer frequency is doubled before use as a 19-GHz Lo, 
an average signal-frequency error of greater than 2 Hz over a 1-second 
interval will produce a 2-Hz local oscillator frequency correction during 
the next second. The loop will thus track a signal moving at up to 2 
Hz/second to +2-Hz accuracy. With six-digit frequency resolution, the 
loop tracking range is 2 MHz. 

The up-down counter may also be controlled from other sources. 
During the signal acquisition phase the counter is automatically stepped 
up or down to sweep the receiver around the expected signal frequency. 
The counter may also be stepped up or down manually. These operations 
are discussed later in this section. 

Long-term frequency memory is implemented by disabling the counter 
inputs when the signal fades below the threshold of the tracking loop. 
Power-failure protection is provided by powering the counter from an 
uninterruptible battery power source. In both cases the long-term 
“frequency memory” of the loop is solely determined by the frequency 
stability of the synthesizer master oscillator. 


5.4 Third LO and PLL 


The third LO is shown in Fig. 8. The 2.06-MHz voltage-controlled 
crystal oscillator (VCXO) source is controlled by the phase locked loop 
indicated on Fig. 2. The oscillator frequency is used directly for the 
19-GHz receiver channels and after division by 2 and multiplication by 
3 for the 28-GHz channels. The phase locked loop removes the remaining 
short-term oscillator frequency variations and presents a frequency- 
stable signal to the final 10-Hz and 1-Hz bandwidth receiver filters. This 
PLL must meet the following requirements: 

(it) ‘Track over +50 Hz. ; 
(it) Have a loop bandwidth selectable around 10 Hz. 

(tii) Hold at “center” frequency on command to within +2 Hz. 

(tv) Exhibit no frequency “walk-off,” and recover quickly from 

signal outages. 
Requirement (7) allows the loop to track the expected range of short-term 
frequency variations. Requirement (ii) permits the trading of loop 
threshold for amplitude stability in the final signal measurement filters 
(narrow bandwidth extends the loop threshold at the expense of am- 
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Fig. 8—Receiver third local oscillator in control room [voltage controlled crystal oscillator 
(VCXO) frequency is 2.0604 MHz]. 


plitude fluctuations due to FM-AM conversion in the measurement fil- 
ters). Requirements (iii) and (iv) guarantee good frequency memory 
during signal fades and recovery from them without long loop pull-in 
times. 

The loop components are shown in more detail in Fig. 9. The received 
-19-GHz V copolarized signal, at 325 Hz, is filtered by an active bandpass 
‘filter, limited, and phase compared with stable 325-Hz reference. The 
phase detector output is filtered by an active lowpass filter and used to 
control the third LO vCxo frequency. Thus, the 19-GHz V-copolarized 
signal is phase locked to the 325-Hz reference, and with it all other re- 
ceiver channels. 

The PLL noise bandwidth is adjustable over the range 5 to 50 Hz to 
encompass the range of short-term satellite oscillator stabilities expected. 
A loop damping factor ¢ of 0.8-1 was desired to avoid phase overshoot.? 
Both of these parameters are influenced by both the loop filter and the 
IF bandpass filter. A test using prototype RF and PLL hardware showed 
that an overall loop bandwidth B could be achieved with a loop filter 
natural frequency w, (rad/sec) = 1.2 B, aloop damping factor ¢ = 1.2, 
and an IF filter bandwidth = 1.5 B. To achieve the desired 5-50 Hz 
overall bandwidth range, the loop filter w, may be varied (through ad- 
justment of R, and Re» in Fig. 9) over 6-60 rad/sec holding ¢ = 1.2, and 
the IF filter bandwidth varied over 9-90 Hz, holding its gain constant. 
This adjustment is made in 1 dB switched steps (bandwidth ratio = 
1.26). 
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Fig. 9—Receiver third local oscillator PLL circuitry. 


The capacitor Ce in the loop filter rolls the loop off to prevent the 
sum-frequency (650 Hz) phase detector output from modulating the 
vcxo. The diode clamps limit the frequency excursion to +50 Hz. The 
VCXO control line is grounded on external command, and holds the vcxo 
at its resting frequency during signal outages. 


5.5 Fourth LO 


The fourth LO shown in Fig. 10 is self-contained; it generates sideband 
receiver-channel local oscillators, LOUs and LOLs, which include the 
required fourth LO frequency corrections. Because of the low frequencies 
involved and the ease of frequency division using medium scale inte- 
grated circuits (ICs) all fourth LO frequencies are derived by division from 
a single 10.5333-MHz crystal oscillator. The divisions (X n/128) that 
determine the final output frequency ratios are done by digital rate 
multipliers. These rate multipliers produce n output pulses for every 
128 input pulses with n selected by pin connections on the Ics. The n 
output pulses are not evenly spaced because they are produced by gating 
the input pulse train. The uneven spacing is equivalent to a deterministic 
timing jitter on the output waveform which is a fixed multiple of the 
input pulse period. The three decade dividers (+ 1000) following the rate 
multipliers preserve this timing jitter, which is then a much smaller 
fraction (1/1000) of the output waveform period. In effect this + 1000 
reduces the index of the jitter phase-modulation by 1000. The jitter 
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Fig. 10—Receiver fourth local oscillator in control room (crystal oscillator frequency 
is 10. 533333 MHz). 


sidebands on the worst fourth LO output (the 9.052 kHz) are 60 dB below 
the LO signal level; most LO outputs have sidebands >65 or 70 dB below 
the signal level. 


5.6 Frequency acquisition 


Since this receiver is designed for unattended operation, it must be 
capable of automatic frequency reacquisition after signal outages. ‘This 
is accomplished by sweeping the receiver through the expected range 
of signal frequencies whenever tracking cannot be maintained. When 
the signal is located, this sweep stops and the receiver returns to its 
normal tracking mode. 

The receiver’s acquisition speed is increased by making use of the 
signal’s last known frequency, its maximum drift rate, and its maximum 
daily frequency excursion. Let: 


fo = last known frequency at time t, 
= |maximum drift rate| 
fmin = Minimum observed frequency 
fmax = Maximum observed frequency. 
Then the frequency region to be searched is bounded by 
max{fmin, (fo — A(t — to))} S f(t) S minffmax, (fo — A(t — to))}. 
Acquisition speed is also improved for short-duration signal outages by 


holding the last known signal frequency and inhibiting the frequency 
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Fig. 11—Typical acquisition sequence for receiver second local oscillator. Frequency 
tracking is lost at time t,. Reacquisition frequency search begins at point A: signal is 
reacquired at point D. 


sweep until the maximum frequency uncertainty exceeds the pull-in 
range of the discriminator. 

Figure 11 illustrates a typical receiver acquisition sequence. At time 
t. the received signal level fades below the tracking threshold. The 
register controlling receiver frequency is held at its last value f, anda 
counter is counted at a rate A to indicate the maximum frequency un- 
certainty as time progresses. When this uncertainty exceeds the pull-in 
range of the tracking discriminator (point A) the frequency-control 
register is stepped up or down continuously to sweep the receiver fre-_ 
quency. Whenever this register’s excursion from f, equals the maximum 
frequency uncertainty the stepping direction of the frequency control 
register is reversed, reversing the direction of frequency sweep (points 
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B). Sweep direction is also reversed at fmax and fmin, the daily frequency 
excursion limits (point C). These limits are set by thumbwheel switches, 
and are reset periodically to account for long-term aging of beacon and 
receiver oscillators. 

During this frequency sweep the unswitched 19-GHz vertically-po- 
larized channel is observed to detect the presence of the beacon signal. 
When its presence has been verified (point D) the receiver returns to its 
tracking mode. The procedure used to accomplish this verification is 
described below. 

As mentioned previously, the 19-GHz received signal is used for re- 
acquisition. However, the situation is complicated by the 1-kHz polar- 
ization switching. This switching produces sidebands separated from 
the carrier by multiples of the switching frequency. The receiver uses 
knowledge of the relative levels of the carrier and sidebands to dis- 
criminate between the two. 

If the polarization modulation were symmetrical (50 percent duty 
cycle) the received spectrum would appear as in Fig. 12a. Only odd-order 
sidebands are present; the first-order (+1 kHz) sidebands are down 3.9 
dB. This pattern of three signals could be detected using three narrow- 
band filters with 1 kHz spacing to indicate the presence of carrier in the 
center filter. If modulation asymmetry is considered, however, these 
three filters are not sufficient. Figure 12b shows the received spectrum 
for 2 percent modulation asymmetry. This asymmetry has generated 
even-order sidebands about 4 dB below the level of the 17th and 19th 
sidebands; three-filter detection would indicate carrier acquisition at 
these two points. A fourth filter located 2 kHz from the carrier filter, 
however, resolves this ambiguity. Its relative output will be low for a true 
detection and high otherwise. Since the asymmetry of the beacon mod- 
ulation was not known during receiver construction, four filters were used 
to unambiguously detect carrier acquisition. 

This four-filter acquisition detector is shown in Fig. 13. Four nar- 
rowband active filters are driven by the 6.25-kHz signal (derived from 
the 19-GHz V channel) used by the AFC loop. These filters have a two- 
pole response with matched 30-Hz bandwidths, and are tuned to pass 
the carrier at 6.25 kHz, two +1 kHz sidebands, and the —2 kHz sideband. 
These filters also are contained in a temperature-controlled oven. Since 
detection decisions are based on ratios of these signals, the filter outputs 
are envelope detected and logged. Differences of these logged signals then 
indicate the desired ratios. 

The 30-Hz filter bandwidth was chosen as a compromise between 
sweep speed and SNR at the filter outputs. The filter outputs will not 
reach their full levels if the sweep speed exceeds an appreciable fraction 
of filter bandwidth per filter impulse response time. Thus, the maximum 
permissible sweep speed increases with the square of the detection filter 


COMSTAR RECEIVING ELECTRONICS 1313 


o— 


l 

oA 

oO 
| 


RELATIVE LEVEL (dB) 


-—20 — 
e@ee 
-—30 — 
—~40 — 
| | | | | | | | | | | | 
—2 0 2 4 6 8 10 12 14 16 18 20 
SIDEBAND FREQUENCY (KHz) 
(a) 
o— 
a —10— 
2 
a 
WwW 
> 
Lu 
wi 
2 
= 
<x 
a | 
dw eee 
_ | 
—40 — | | | | | | | | | | | | 
| | | | | | | | | | 
—2 (¢) 2 4 6 8 10 12 14 16 18 20 
SIDEBAND FREQUENCY (KHz) 
(b) 


Fig. 12—19-GHz polarization-switching sideband levels, with and without switching 
asymmetry. Note the even-order sidebands present with switching asymmetry. 


bandwidth. The signal level required to achieve a given SNR, however, 
increases linearly with filter bandwidth. A 10-Hz filter bandwidth (that 
used in the receiver’s 19-GHz signal channels) was found to require a 
sweep speed of <50 Hz/sec for reliable detection. With the 30-Hz filters 
used, the sweep speed may be increased to 250 Hz/sec, speeding the re- 
acquisition process. 

Several conditions must be satisfied to indicate the presence of re- 
ceived carrier in the 6.25 kHz carrier filter. The carrier filter output must 
be >10 dB above its no-signal level to assure a reasonable false-alarm 
rate. Both +1 kHz sideband filter outputs must be between 2 and 6 dB 
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_ Fig. 13—Four-filter acquisition detector (part of receiver second local oscillator AFC 
circuitry). 


down from the carrier level. This range allows for slight nonlinearities 
in the log converter slopes and the effect of noise at low signal levels. 
Finally, the —2 kHz sideband filter output must be either <10 dB above 
its noise level or >20 dB below the carrier filter output. If all these con- 
ditions are satisfied a four-channel detection signal is given. This action 
interrupts the acquisition frequency sweep and turns on the AFC to at- 
tempt to track the signal. The four-channel detection signal is observed 
for 15 seconds to verify the detection. If the detector output is true for 
>50 percent of this time, a valid signal is assumed to be present and the 
receiver returns to its tracking mode. Otherwise, the acquisition search 
is continued until the signal is found. 

A second signal is generated in conjunction with the four-channel 
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Fig. 14—1-kHz polarization switch synchronizing circuits. 


detection output to further guard against tracking of a 19-GHz sideband. 
An error signal is generated if the carrier filter output is >20 dB above 
its noise level but no four-channel detection has been made. (Such a 
condition would occur if the receiver were tracking a sideband.) If this 
error signal persists for 10 seconds a frequency sweep is initiated over 
the full uncertainty band. This action should never be performed, but 
further assures the eventual acquisition of only the 19 GHz carrier.* 


VI. POLARIZATION SWITCH SYNCHRONIZATION 


A separate IF channel without switches is provided for synchronizing 
the main-channel polarization switches (see Fig. 1) with the 1-kHz po- 
larization-switched beacon signals. Since the clear-air signal-to-noise 
ratio in a 2-kHz bandwidth is <+30 dB and low jitter (<5 percent of a 
switching period) switching is desired for rain attenuation of at least 40 
dB, a very long (~100 sec) time constant (narrowband) PLL is required 
to recover the 1-kHz switching signal. Such a long time constant loop 
has an even longer pull-in time; therefore, frequency and phase memory 
through severe rain attenuation events (>40 or 50 dB) on the order of 
10 minutes is needed to prevent loss of data for long periods after such 
severe events. The long PLL time constants and even longer memory are 
easiest to implement digitally. 

The polarization switch synchronizing circuits in Fig. 14 follow the 
1 GHz to 2 MHz frequency conversion in the acquisition and polarization 
switch synchronization channel in Fig. 1. The 1-kHz digital PLL is shown © 
in more detail in Fig. 15. 

As indicated in Fig. 14, the automatic gain controlled (AGC) 2-MHz 
IF amplifier holds the 1-kHz signal into the oven-enclosed 1-Hz BW active 
BPF constant over the range from the clear air signal level to the noise 
level of the 1-Hz BW BPF input. The filtered 1 kHz is limited, fed to the 
PLL, and is detected to provide a threshold comparison voltage. The 

noise voltage in a 50-Hz Bw at 1.2 kHz is rectified and lowpass-filtered 
for the threshold detector reference. 

The square-wave PLL output is phase-locked to the satellite polar- 
ization switch. However, because of filter delays, etc., the square-wave 
transitions and the beacon signal transitions at the receiver switches do 
not occur at the same instants of time. Delay and symmetry adjustments 

_after the PLL permit adjustment of the switching waveform transitions 
for coincidence with the signal transitions. Further shaping of the switch 
driving waveform for the 2-MHz third IF polarization switches (Fig. 1) 
provides a “dead zone” around transitions to allow switching transients 
to settle. 


* This circuitry was later disabled since it was often triggered by strong depolarization 
produced by atmospheric ice crystals.!” 
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Fig. 15—1-kHz digital phase-locked loop (PLL). 


6.1 Digital phase-locked loop 


The digital PLL in Fig. 15 is equivalent to a conventional second order 
loop? with an active loop filter. The frequency of a 1-MHz voltage con- 
trolled crystal oscillator (VCXO) is controlled by the loop amplifier output 
voltage. The 1 MHz is divided in digital decade dividers to 1 kHz and 
compared in phase with the limited 1 kHz derived from the polariza- 

‘tion-switched beacon signal. The notch filter following the analog- 

multiplier phase-detector prevents the 2-kHz sum frequency from sat- 
urating the loop amplifier. The straight-through path from the notch 
filter to the summer (through the switch and resistor described later) 
establishes the high-frequency open-loop gain. The digital capacitor 
output is summed with the straight-through path to determine the 
open-loop low-frequency response. 

Operation of the digital capacitor centers on the up-down counter. A 
fixed clock frequency is divided by two and applied to the count up input 
of the counter. Input pulses are blocked by the inhibit gates when a 
control voltage appears on the inhibit control line; otherwise, all input 
pulses are passed on to the output. The analog-to-digital converter (ADC) 
samples the phase detector output every 10 clock cycles and converts 
it to a 10-bit digital word, M, that serves as the control word for a digital 
rate multiplier. With an ADC input of 0, M = 512, and the rate multiplier 
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passes 512/1024 = 1/2 of the input pulses to the count down input of the 
counter. Under this condition, the counter alternately counts up and 
down so there is no net change in count. The pulses into the count up and 
count down inputs and the convert pulses to the ADC are appropriately 
timed to prevent ambiguities that would arise if these pulses were nearly 
coincident. Positive phase detector outputs produce M > 512. With M 
> 512 there are more down counts than up counts in a given time period 
and the counter counts down at a rate proportional to the phase error. 
With a negative phase error, M < 512, there are more up counts than 
down, and the counter counts up. The count in the counter is analogous 
to charge stored in a capacitor; M is analogous to current into the ca- 
pacitor; and the ADC is equivalent to a voltage to current converter. The 
analogy is completed by the digital-to-analog converter that converts 
the counter count to a voltage, the capacitor voltage, with 0 count being 
Maximum positive voltage, maximum count being maximum negative 
voltage and mid-count being 0. The equivalent capacitance is a function 
of the clock rate, the ADC voltage-to-M factor and the DAC count-to- 
voltage factor. The counter saturates at maximum count and at 0 count 
so the contents are not “spilled” by either a continuous positive or neg- 
ative phase error. The DAC output is filtered to smooth the voltage 
steps. 

In the normal run mode of the PLL, the digital capacitor clock fre- 
quency is 35 Hz, the natural frequency, w,, of the PLL is 6 X 1073 and 
the damping is 0.5. A faster loop response is available for faster acqui- 
sition (acq) of phase lock with high signal levels. This is accomplished 
by increasing the clock frequency to 15 kHz to decrease “capacitance” 
and by decreasing the loop summing resistance to increase the high- 
frequency open-loop gain and thus maintain the damping at 0.5. The 
fast acquisition mode can be selected manually with a single switch. The 
digital capacitor departs from a real capacitor when capacitance is 
changed by changing clock frequency. Changing clock frequency does 
not instantly change the counter count; it changes only the rate of count. 
Thus, digital capacitor voltage is conserved with capacitance change and 
“charge” is not conserved. This is a useful property in this PLL because 
the VCXO control voltage and thus the vCxO frequency and phase remain 
continuous when the loop natural frequency is changed discontinuously. 
The PLL then remains in lock and does not experience a phase step in 
going from the fast acquisition mode to the run mode. 

Phase and frequency memory during loss of signal or loss of primary 
power is provided by the up-down counter, loop amplifier, VCXO, and 
+1000. These components are supplied from a battery power supply that 
normally floats on a charger. 

For loss of power all inputs to the up-down counter are inhibited to 
prevent start-up transients from disrupting the held count. 
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For loss of signal, counter memory is provided by inhibiting the clock. 
Continued operation at the average phase and frequency before the hold 
_is insured by grounding the summing input from the phase detector. The 
VCXO is in an oven to insure that the stability of the held phase is better 
than the phase stability of the polarization switching oscillator in the 
satellite. 


Vil. DATA COLLECTION 


The data collection equipment is common to all receiving channels 
as indicated in Fig. 16. Data that are critical for maintaining continuity 
in the data base for long term statistics, e.g., signal attenuation and de- 
polarization, are recorded continuously on analog ink-pen paper-chart 
recorders. These chart recordings provide a backup in the event of failure 
of the digital recording system and also provide a “quick look” at the 
recorded data. The logarithms of signal amplitudes are recorded on the 
chart recorders with a range of 50 dB. 

All receiver outputs are multiplexed along with (i) system status in- 
dicators such as whether the frequency control loop is tracking or holding 
in a signal fade, (iz) outputs from weather instruments such as rain 
gauges, thermometers and wind speed recorders, (iti) outputs from the 
on-going interim experiment!® using the COMSTAR beacon at 128°W, 
and (iv) another propagation experiment!! using a 12-GHz beacon on 

_the NASA/Canadian Communications Technology Satellite (CTS). These 
multiplexed signals are digitized, temporarily stored in the minicomputer 
core memory, screened for relevance, and stored on digital magnetic tape. 
Multiplexer and analog-to-digital converter sequencing, digital data 
screening and buffering, and digital tape drive control are handled by 
the same minicomputer that points the receiving antenna. 

The objectives of the data screening procedure are to minimize the 
amount of superfluous data stored while not losing any relevant prop- 
agation data. The screening algorithm copes with the multiplicative 
signal fluctuations caused by the atmosphere and with the additive noise 
that dominates at low signal level. 

The data taking and screening procedure is outlined in the simplified 
flow chart in Fig. 17. Four times a second all receiver outputs are digitized 
and temporarily stored as a sample set (boxes 2 and 3). Each entry j in 
the sample set is checked for a large impulse change by testing for 


|sample value; — old mean value;| < kijos; 


where the old mean value is the stored mean from the preceding time 
interval, k;; is a scaling factor and o,; is the expected standard deviation 
for an individual sample in entry j. For all amplitude entries 


G,; lamoliude = daj + (old mean value;) . Cn; 
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Fig. 16—Data collection equipment. 


where oq; accounts for additive noise and C,,; weighted by the old mean 
value accounts for the multiplicative atmospheric effects. For phase 
difference entries, the multiplicative factor is inversely scaled by the old 
amplitude mean of the weakest signal in the phase difference pair: 


05; | phase = Faj + Cmj/(old amplitude mean). 


If an impulse, i.e., a large change in the data, has not occurred, the entries 
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Fig. 17—F low chart for real-time data screening software. 





SAMPLE SET 
TO BUFFER 


in the sample set are used to update a set of new mean values being ac- 
cumulated (boxes 4 and 5). The set of new mean values is then tested for 
change by 


|new mean value; — old mean value;| < kpjosj/VN (6) 


where k,,; is a different (smaller) scaling constant and N is the current 
number of samples included in the new mean (box 6). The sample 
standard deviation o;; is scaled by WN for this test since the standard 
deviation of the statistical fluctuations of the mean decreases by this 
factor (assuming independent samples). This second test quickly detects 
gradual signal changes. If a change is not detected and if the new mean 
has been accumulating for less than 1 minute, i.e., N < 240, then the data 
taking program waits for the next clock pulse (box 7). If a change is de- 
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tected by either of the above tests or if N = 240 then the new mean values 
are transferred to the next unused position in an output buffer (box 8), 
the old mean values are set equal to the new mean values (box 9) and the 
new mean values are initialized (box 10). If 20 entries of sets of values 
have been made into the output buffer, the buffer is full and it is written 
onto magnetic tape (boxes 11 and 12). If this pass through the procedure 
is not from the impulse test or is the second pass of the first impulse 
detected, the program waits for the next clock pulse (box 13). If this is 
the first pass from a detected impulse, however, the entire set of samples 
is transferred to the output buffer and the old mean values are set equal 
to the sample set (boxes 14 and 15). Thus, when an impulse occurs, both 
the previous running mean and the sample set containing the impulse 
are stored. Data screening then proceeds normally. 

Each set of samples or means recorded also contains the time it was 
recorded:so the time interval spanned is available for off-line data re- 
duction. The data screening software includes provisions for: (i) stopping 
and starting data collection, (ii) recovering from primary power inter- 
ruption, and (iii) recording calibration signals on the data tapes. 


Vill. CALIBRATION SOURCE 


The operation of the entire receiving system may be verified using a 
19- and 28-GHz signal source with RF characteristic closely duplicating 
those of the beacon itself. Knowledge of the signal source amplitude also 
allows determination of the absolute received signal level. This source 
was extremely useful during prelaunch receiver construction and check 
out. 

A block diagram of the calibration source (beacon simulator) is shown 
in Fig. 18. A 66.11-MHz crystal oscillator drives a multiplier chain, 
producing three output frequencies: 264.44 MHz, 19.04 GHz, and 28.56 

GHz. Samples of the 19- and 28-GHz signals are made available to in- 
dicate the power output of the source. The 28-GHz signal is attenuated 
and phase-modulated by either the 264-MHz multiplier output or that 
output doubled to 529 MHz. The modulation index is set to produce 
sideband levels similar to those generated by the satellite. A precision 
calibrated variable attenuator allows level control from near clear-air 
levels to the receiver noise floor. A cross-polarized signal component is 
obtained from a 20-dB directional coupler. 

The 19-GHz multiplier output is attenuated through fixed and vari- 
able attenuators. A cross-polarized component is generated with a 30-dB 
coupler. Both direct and cross-polarized signals are power-split using 
3-dB quadrature hybrids. The two 19-GHz outputs are switched alter- 
nately between a “direct” and a “cross-polarized” hybrid output using 
PIN diode switches. A 1-MHz crystal oscillator and +1000 frequency 
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Fig. 18—Receiver calibration source (beacon simulator). 
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divider switch these switches at a 1-KHz rate. Direct and “cross-polar- 
ized” signals then appear alternately at the two 19-GHz outputs. 

All four source outputs are permanently connected to their respective 
receiver mixer inputs through directional couplers so that the source 
signals can be injected with no waveguide switching. The source outputs 
are disabled by turning off power amplifiers in the frequency multi- 
plier. 

Since the calibration source operates in close proximity to the antenna 
feed frame, care was taken to hold any RF leakage from the source to 
<-—150 dBm. Coupled with the directivity of the antenna feed assembly, 
this assures precise calibration signal levels down to the system noise 
floor. 


IX. RECEIVER PERFORMANCE 


The entire receiver has met all original performance requirements. 
Care in the design of individual assemblies has allowed integration of 
the entire receiving system with extremely minimal interaction or other 
problems. The receiver has been extremely reliable; during more than 
one year of operation, no significant data has been lost due to receiver 
failures. More detailed observations of actual receiver performance are 
given below. 

Overall receiver linearity is nearly perfect. Figure 19 shows the de- 
viation from perfect linearity of a typical receiver output (19-GHz ver- 
tical copol) as a function of input signal level. Other receiver outputs 
exhibit similar results. The calibration source was used to obtain these 
results; for the upper 20 dB of input level the source was temporarily 
connected directly to the mixer inputs. The input signal level is nor- 
malized to the clear-air satellite input level. The receiver is seen to be 
linear within +0.4 dB for all expected input levels down to 55 dB at- 
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tenuation. Below this level the measurement becomes noise-dominated. 
The 1-dB gain compression point occurs 6.5 dB above the clear-air input 
level. The receiver exhibits negligible differential phase shift with 
variations of signal amplitude. As an example, the 19-GHz copolarized 
differential phase is constant within 0.1° from 4 dB above the clear-air 
input level to the 40-dB attenuation level. Two degrees of phase error 
is observed at the 50-dB attenuation level. 

The receiver has shown excellent long-term gain stability. Measure- 
ments over a 4-month period have shown a receiver output change of less 
than 0.2 dB. This figure includes both receiver and antenna gain changes 
and satellite power fluctuations. 

Long-term measurements of 19 GHz copolarized differential phase 
are stable to within +2°. This indicates the overall phase stability of both 
the satellite and receiver antennas and much of the receiving electronics, 
including all narrowband filters. Early measurements on the antenna 
feed frame* showed a temperature coefficient of <0.2°/C° at 19 GHz. 
This has since been improved through equalization of waveguide 
lengths. 

The 2-MHz crystal filters have a temperature coefficient of <0.16°/C°. 
The two critical 19-GHz V and H copolarized filters were chosen for 
matched temperature coefficients, and track within +0.2° over 15 to 
40°C. The air-conditioned building environment maintains phase 
tracking on all other channels within +1°. 

The third and fourth IF active bandpass filters are in temperature- 
controlled ovens and are unaffected by ambient temperature variations. 
These filters were bandwidth-matched to within +1 percent, yielding 
a frequency-induced phase error of <0.5° for static frequency offsets of 
up to 5 Hz. 

As stated previously, the receiver noise floor is set by front end noise; 
all following amplifiers and frequency conversions contribute <0.1 dB 
to the overall system noise level. 

The polarization switch synchronization circuitry must accurately 
track the 1-kHz polarization switching phase to prevent leakage of co- 
polarized signals into receiver crosspolarized channels. This circuitry 
exhibits £5° phase jitter during a 40-dB fade and holds phase open-loop 
for greater attenuation. A dead band of +9° is sufficient to maintain 
open-loop synchronization for at least 10 minutes. 

The AFC loop will reliably track the satellite frequency excursions 
during a 51 dB fade at 19 GHz (copol SNR = 9 dB). Below this level, the 
loop initiates its reacquisition sequence. Initial four-channel detection 
is accomplished with 50 percent probability at the 49 dB fade level, while 
50 percent final verification probability is reached at the 47 dB fade 
level. 

An example of data obtained during a typical rain event is shown in 
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Fig. 20—Typical receiver outputs from data screening software. 


Fig. 20. Only 9 of the 25 data channels recorded are shown here. The 
display was generated by software used for off-line data screening. The 
ticks along the x-axis delimit records containing 20 sets of data points 
(see Section VII on data collection) and indicate the increase in data- 
recording rate during such events. 

The 4 upper traces display 19-GHz vertical copolarized (A19V) and 
cross-polarized (19XV) signal levels, vertical cross-polarization phase 
(@19V-XV), and copolarization differential phase (¢19V-H). The 
cross-polarization phase zero reference was obtained by rotating the 
antenna feed frame slightly to leak a copolarized component into the 
cross-polarization channels. 

The next four traces display 28-GHz vertical copolarized carrier 
(A28VC), lower sideband (A28VL), and cross-polarized signal level 


COMSTAR RECEIVING ELECTRONICS 1327 


(A28XV), and lower sideband differential phase (6¢28VC-L). The close 
agreement of carrier and sideband amplitudes and lack of relative phase 
shift indicate little medium dispersion over the 264-MHz spacing. The 
phase fluctuations observed at low signal amplitudes are due to the low 
measurement SNR, and are not medium effects. 

The final trace displays the vertical copolarized output of the colocated 
interim receiver!° observing the COMSTAR beacon at 128°W longitude. 
The differences between this and the first trace result from the different 
ray paths taken through the rain. 


X. SUMMARY 


This paper has presented the design requirements for the Bell Lab- 
oratories Crawford Hill 19- and 28-GHz COMSTAR beacon receiving 
electronics, and has described the hardware and techniques used to 
achieve these requirements. The receiver operates unattended, contin- 
uously collecting data to determine attenuation, depolarization, dif- 
ferential phase shift, dispersion and angular scatter produced by pre- 
cipitation. The coherence of the various transmitted beacon signals al- 
lows use of receiver noise bandwidths as narrow as 1.6 Hz to determine 
depolarization during severe fading. The receiving system meets all the 
design requirements, and has already collected data reliably for over one 
year. 
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The 19-GHz Receiving System for an Interim 
COMSTAR Beacon Propagation Experiment at 
Crawford Hill 
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This paper describes the antenna and receiving electronics for the 
Bell Laboratories Crawford Hill 19-GHz COMSTAR Interim Experiment. 
This experiment has collected essentially continuous 19-GHz atten- 
uation data on the earth-space path to the COMSTAR A satellite since 
May 25, 1976. The receiver operates unattended, automatically reac- 
quiring the beacon signals after deep rain-induced signal fades. A re- 
ceiver bandwidth of 10 Hz allows accurate measurement of fade depth 
to the 40-dB level. 


1. INTRODUCTION 


A receiving system for the 19-GHz COMSTAR beacons! has been 
operating since May 25, 1976, at the Bell Laboratories Crawford Hill 
facility at Holmdel, New Jersey. This system is less complex than the 
precision receiving system** also operating there, but uses the same 
overall system design and many similar components. This paper will 
discuss the antenna and receiving electronics for this system. 

This receiving system was used initially for observations of the first 
COMSTAR beacon as it crossed the horizon at Crawford Hill. The main 
receiving system electronics® were operated for a short period in parallel 
with this system, while awaiting completion of the Crawford Hill 
millimeter-wave antenna. 

The interim receiving system is presently observing the beacon located 
at 128°W longitude. The propagation path from Crawford Hill to the 
beacon has an azimuth of 244.7° and an elevation of 18.5°. The beacon 
characteristics are similar to those given in Table I of Ref. 1; the incident 
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polarizations are within 5° of vertical and horizontal at Crawford Hill. 
Beacon observations have been nearly continuous since the start of the 
experiment. Much useful information has already been obtained and 
has been reported elsewhere.®-§:7 Since the main receiving system is 
observing the satellite at 95° W longitude, comparison of the two receiver 
outputs should indicate the efficacy of “satellite diversity” from a 
common earth terminal. 

Future plans for the interim receiving system include its use as a re- 
mote space diversity site. This will allow accumulation of joint fading 
statistics to fade depths not achievable in previous radiometer experi- 
ments.® 

In its basic configuration, this receiver records the amplitudes of two 
orthogonal components of the 19-GHz incident radiation. Two identical 
receiver channels are used. Narrow receiver noise bandwidths are used 
to maximize the measuring range. Automatic frequency tracking and 
reacquisition allow unattended operation. The receiving antenna 
beamwidth is sufficiently wide that no tracking of diurnal satellite mo- 
tion is required. 

The general design considerations for this receiving system are similar 
to those discussed for the main receiving system.® The receiving antenna 
is described in Section II. Section III discusses the receiving electronics. 
Receiver performance and some typical data are included in Section 
IV. 


il. ANTENNA 


The antenna and feed assembly were originally designed for a 20-GHz 
propagation experiment using a beacon on the ATS-6 satellite.? The feed 
assembly and antenna positioner were later modified for this experiment. 
The antenna is of Cassegrainian design, with a 12-foot-diameter aperture 
and two orthogonal linearly polarized feeds. Antenna positioning is 
controlled remotely from the equipment building. The complete antenna 
assembly is shown in Fig. 1. 

The 12-foot-diameter spun-aluminum main reflector was manufac- 
tured for nominal use at C and X bands. After tensioning the reflector 
to correct a surface warpage, however, good performance was obtained 
at 20 GHz. 

The 16-inch hyperboloidal subreflector is supported by four aluminum 
I beams, thinned for minimum depolarization. Both subreflector and 
support had nonessential material milled away to minimize weight. 

Polarization diplexing is accomplished using a quasioptical polar- 
ization separator, as in the main antenna feed assembly.4 This technique 
is shown in Fig. 2. The polarization separator is a grid of parallel copper 
strips on a thin mylar support membrane. The grid is oriented so that 
one polarization, B (polarized parallel to the page), is transmitted 
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Fig. 1—View of 12-foot antenna for COMSTAR interim experiment. 


through the polarizer to aperture I. The orthogonal polarization, A, is 
reflected to aperture II. 

Short-focal-length paraboloids at apertures I and II reflect the received 
energy to dual-mode feedhorns. The feedhorn size produces a 20-dB edge 
illumination taper at the main reflector. The feed assembly is rotatable 
from the equipment building to allow alignment of the feed with the 
incident polarization angle. 
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Fig. 2—Polarization separator geometry for 12-foot antenna. 


The antenna is mounted on a modified Nike-Hercules elevation- 
over-azimuth positioner. New drive motors and position encoders allow 
positioning the antenna from within the equipment building to 0.01 
degree precision. Since the satellite position is controlled to within 0.1 
degree, continuous position tracking is not required. 

The feed assembly and mylar rain window are shrouded with fiberglass 
weather covers. These are covered with reflective aluminum foil to 
minimize the “greenhouse” effect. Electric heaters and forced ventilation 
regulate the temperature within the enclosure. 


ll. RECEIVING ELECTRONICS 


The electronics for the interim receiver are similar to those used in 
the main COMSTAR beacon receiver.® This section will rely heavily on 
the description of the main receiver electronics in this issue. The basic 
design philosophy and rationale behind the choice of IF frequencies, etc., 
is covered there. 

A block diagram of the interim receiver electronics is shown in Fig. 
3. The receiver consists of two unswitched 19-GHz receiver channels and 
somewhat simplified frequency control equipment. The first frequency 
conversion is performed at the antenna feed. All other equipment is lo- 
cated in a building alongside the antenna. 

Throughout the receiver, care was taken to assure >60 dB overall 
isolation between receiver channels. Liberal component shielding was 
used, and isolators were used where necessary to avoid coupling through 
common local oscillator lines. 

The two antenna feed outputs are down-converted to 1.003 GHz by 
Schottky-diode mixer-preamplifiers with 6.5-dB single-sideband noise 
figure. The 18.037-GHz first LO is generated at the feeds from an oscil- 
lator in the support building. 
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The two IF signals are fed to the support building through coaxial cable 
and are filtered by 0.8 GHz BW bandpass filters to avoid noise saturation 
of the following wideband IF stages. Down-conversion to 2.067 MHz is 
performed with image rejection mixers, which use phase cancellation 
to suppress image noise by >20 dB. Coarse frequency tracking is per- 
formed at this conversion, as will be discussed later. 

After further amplification, the system noise bandwidth is further 
constrained with 5-MHz low-pass filters. Step attenuators set the 
clear-air signals to the desired levels. Amplifiers are used to isolate 6-kHz 
BW crystal bandpass filters, which provide image rejection at the next 
conversion. 

Balanced mixers perform the next frequency conversion to 6.25 kHz. 
Short-term frequency instabilities are removed at this step with a 
phase-locked loop, whose operation will be described later. The 6.25 kHz 
signals are amplified and filtered by 250 Hz BW active bandpass filters. 
These and the following filters are mounted in temperature-stabilized 
ovens for improved gain and frequency stability. 

The final frequency conversion to 325 Hz is performed by an active 
linear multiplier. The final predetection bandwidth is set by 10-Hz BW 
active bandpass filters. These filters exhibit a single-pole response with 
16-Hz noise bandwidth. These filters strip off the 1-kHz polarization- 
switching modulation and pass only the carrier frequency. 

Signal amplitudes are determined with linear amplitude detectors 
exhibiting +0.1 dB linearity over 60-dB signal range. The detector 
outputs are processed by dc logarithmic amplifiers for better display 
resolution during deep rain fades. The two log amplifier outputs are 
recorded on a paper chart recorder operating at 4 inches/hour. These 
two outputs are also fed to the main receiver data recording equipment 
over telephone lines, using voltage/frequency and frequency/voltage 
converters. Log, rather than linear, recorder outputs are used to avoid 
dc offset problems at low signal levels. 

Since a measurement of differential amplitude between the two re- 
ceived polarizations is desired, the two log amplifier outputs are sub- 
tracted and this difference recorded, with higher sensitivity, on the chart 
recorder. In addition, 1- and 10-minute timing markers are recorded from 
the main receiver to allow time synchronization of the two systems. 

The receiver must track both the long- and short-term beacon fre- 
quency fluctuations. A digitally controlled AFC loop is used to track the 
thermally induced diurnal fluctuations. This loop is illustrated in Fig. 
9 of Ref. 3. A sample of the 2-MHz vertically polarized signal is down- 
converted to 6.25 kHz and fed to a narrowband discriminator made from 
two stagger-tuned active filters. The discriminator output is integrated 
for 1 second. If the average frequency error exceeds 2 Hz, the output 
frequency of a digitally controlled frequency synthesizer is incremented 
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Fig. 3—Block diagram of COMSTAR interim receiver. 


or decremented by 1 Hz. The synthesizer output is doubled and used to 
make up the second local oscillator. Thus, frequency corrections are 
made in increments of 2 Hz at 1 Hz rate. The loop will track a frequency 
excursion of +2 Hz/sec, greater than that expected from the bea- 
con.1}:3 

If the signal at the vertically polarized receiver output falls below a 
preset threshold, the AFC loop will be unable to maintain track and so 
initiates an ever-expanding search around the last known beacon fre- 
quency. When the signal is again detected at the receiver output, this 
search ceases and tracking resumes. This technique does not prohibit 
acquisition of a polarization-switching sideband, but performs ade- 
quately and is much simpler to implement than the technique used in 
the main receiver. 

Short-term frequency fluctuations are tracked with a phase-locked 
loop (PLL). This loop locks the unfiltered 325-Hz vertically polarized 
received signal to a stable 325-Hz oscillator through adjustment of the 
2-MHz third conversion oscillator. Since both vertical and horizontal 
signals are phase-coherent, the horizontally polarized signal will be 
locked as well. This loop has a 30-Hz bandwidth and is described in 
greater detail in Ref. 3. 


IV. RECEIVER PERFORMANCE 


The interim experiment has operated essentially continuously since 
May 25, 1976, and has met its design objective of collecting continuous 
19-GHz amplitude statistics from the COMSTAR A satellite. Receiver 
failures have been minimal, and the conservative design approach taken 
allowed integration of the entire receiving system with no unexpected 
interactions between subassemblies. 

Since most subassemblies for this receiver are identical to those used 
in the main receiver, niost performance measures are identical for the 
two. Linearity and long-term stability are discussed in Section IX of 
Ref. 3. 

Since this receiver operates without polarization switching using a 
smaller antenna aperture, the measured clear-air SNR is 50 dB. The AFC 
loop threshold is set at the 40-dB fade level; below this level the AFC 
initiates a frequency search to attempt to reacquire the beacon signal. 
Reacquisition is accomplished reliably to 11 dB SNR, corresponding to 
a fade depth of 39 dB. 

Since the receiving antenna does not track the satellite motion, small 
diurnal amplitude variations are observed as the satellite traverses the 
antenna directivity pattern. These variations are generally less than 
1 dB peak-to-peak. Since they are of long duration and repeatable on 
a day-to-day basis, they pose little problem during data reduction. 
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Fig. 4—Typical data obtained from COMSTAR interim experiment (Hurricane Belle, 
August 9, 1976). 


An example of the data collected by this experiment is shown in Fig. 
4, These data were taken August 8-9, 1976, during the passage of hur- 
ricane Belle 50-60 miles east of Crawford Hill. Time, indicated in GMT, 
runs from right to left. The upper and lower traces indicate the vertically 
and horizontally polarized received signal strengths and are calibrated 
in dB attenuation from the clear-air signal level. While this event is 
clearly not a typical one, the data shown indicate excellent receiver 
performance during periods of great environmental stress. 


V. SUMMARY 


This paper has presented a brief description of the antenna and re- 
ceiving electronics for the Bell Laboratories Crawford Hill 19-GHz 
COMSTAR Interim Experiment. This equipment has collected essentially 
continuous amplitude data from the COMSTAR A satellite since it first 
became visible at the horizon on May 25, 1976. A received bandwidth 
of 10 Hz allows accurate measurement of fade depth to 40-dB level. 
These data, together with those collected by the main Crawford Hill 
COMSTAR propagation experiment, will be used to characterize earth- 
satellite propagation at 19-GHz. 
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COMSTAR Beacon Receiver 
Diversity Experiment 


By N. F. DINN and G. A. ZIMMERMAN 
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The design and realization of 19-GHz and 29-GHz beacon receivers 
for implementation of the remote site diversity reception experiment 
are discussed. The experiment objectives and constraints are investi- 
gated in terms of their impact on equipment realization. Data acqui- 
sition and retrieval problems associated with remote sites are also 
addressed. Finally, some of the results obtained from early operation 
are presented. These results, obtained from direct measurement of the 
beacons, correlate very well with earlier radiometer measurements 
scaled in frequency with appropriate corrections made for the impact 
of energy scattering due to rain. 


I. INTRODUCTION 


While most current-generation communication satellites operate in 
the same common carrier bands (4 and 6 GHz) as do terrestrial facilities, 
operation at significantly higher frequencies—such as the 12- to 14-GHz 
band, and the 18-, 30-GHz bands—offers a number of important ad- 
vantages. Among these are expected reduction in interference, reduced 
spacecraft component sizes, and higher gain spacecraft antenna with 
the opportunity for independent multiple beams within the continental 
United States (CONUS). 

On the other side of the ledger, radiation at these frequencies is more 
effectively scattered and attenuated by water droplets, thus threatening 
system operation with attenuation, depolarization and dispersion effects. 
These problems are under empirical investigation utilizing beacon 
sources carried aboard the COMSTAR satellites. This new opportunity 
follows a fruitful (but limited) period of measurements with radiome- 
ters.1.2.3 
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Where uncertainties in radiometer results existed in the past, due to 
their range limitation and their inability to account for signal loss due 
to scattering, they can now be overcome using the beacons. Thus the 
objectives of this new experimental phase are manifold but they can be 
summarized as follows: 

(i) Directly obtain continuous, long-term attenuation measurements 
at (the higher) system frequencies (thus accounting for both absorption 
and scattering). 

(ii) Increase the dynamic fade measurement range to at least 30 
dB. 

(iit) Provide for direct comparison of radiometer and beacon mea- 
surements obtained simultaneously from the same antenna, thus al- 
lowing qualification of radiometer data in hand. 

(tv) Extend the available site diversity performance data base by 
operating in different meteorological environments. 

Satisfaction of these objectives constrained the design of our receiving 
stations. In particular, the need for reliable continuous operation requires 
a capability not only for monitoring the system remotely but also for 
retrieving data to a central site. The system must provide for unmanned 
operation over extended periods without loss of data. Finally, the rec- 
ognition that test sites would be distant (and costly to visit) required that 
the experiment be implemented to be self-contained and transportable 
with minimum expense and delays, that equipment failures be very in- 
frequent, and that simple failures not compromise all observations. 
Consequently, conservative design approaches were used throughout, 
and provisions were made for operation alarming. The method used to 
achieve the experimental objectives is discussed in subsequent sec- 
tions. 

Section II presents a discussion of the primary factors which shaped 
the experimental setup, including such things as anticipated signal- 
to-noise ratios, the importance of remote test sites, and cost. This is 
followed in Section III by a detailed description of the data acquisition 
requirements and their realization. Section IV begins the discussion of 
the receiver, starting with the preliminary operating objectives. The 
overall receiver realization is presented in some detail in Section V and 
includes discussion of the antenna with its auxiliary equipment, the 
scanning receiver, the 19-GHz receiver, the 28.5-GHz receiver, and the 
frequency predictor which compensates for satellite drift in the absence 
of beacon frequency update information. Results obtained during the 
initial experiment phases are summarized in Section VI. 

The Phase I experimental sites are located at Grant Park, Illinois (near 
Chicago), and Palmetto, Georgia (near Atlanta). Each principal site is 
equipped with beacon receivers (19 and 29 GHz) and a 13-GHz radi- 
ometer. Associated with these sites are remote radiometer sites which 
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Fig. 1—Beacon/radiometer sites. 


provide data for diversity performance evaluation. Figure 1 indicates 
Manteno, Illinois, approximately 20 miles west of Grant Park, and 
Temple, Georgia, approximately 20 miles northwest of Palmetto. 


ll. PRIMARY EXPERIMENTAL CONSIDERATIONS 

Six factors were of prime importance to the design of the beacon re- 
ception equipment: 

(i) The expected power level of the received signals and the required 


dynamic measurement range. 
(ii) The spectral width, maximum frequency variations and the rate 


of drift of the beacon oscillators. 
(tii) The anticipated satellite stationkeeping excursions. 
(tv) The requirement for remote unattended and continuously op- 


erative stations. 
(v) The need for coordinated diversity site radiometer data acqui- 


sition. 
i ce aime 29 + 
(vi) “Minimum” cost procurement and operation. 


2.1 Preliminary signal-to-noise ratio calculations 


The beacon signal levels and CONUS coverage are detailed in Ref. 4; 
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the relevant parameters are summarized below: 


19 GHz 29 GHz... 
EIRP 53 dBm 56 dBm 
Polarization loss* 3 dB — 
Path loss 210 dB 213 dB 
Fade range ~35 dB ~38 dB 
Min. signal level <-—195 dBm <-—198 dBm 
+ Gat + Gat 
Down converter + 
preamp noise figure 6.5 dB 6.5 dB 


The corresponding noise power in a 100-Hz! bandwidth is readily cal- 
culated: 


P, = Nr + 10 log KTB, + 30 dBm 
= 6.5 — 174+ 10 log 100 dBm 
P, = —147.5 dBm 


Allowing a few dB loss of sensitivity due to satellite stationkeeping 
variations, antenna misalignment, network loss, etc., implies that 
achievement of a usable signal-to-noise ratio during deep fades requires 
an antenna gain of 50 dB or more. In addition, to limit the noise power, 
it implies a final processing bandwidth significantly less than 100 
Hz—which therefore requires the receiver to track the beacon frequency, 
maintaining lock during fades of at least 35 dB. 


2.2 Beacon frequency variation 


The beacon reference oscillator frequency exhibits a frequency vari- 
ation less than +1 part in 106 on a diurnal basis, and +1 part in 10® per 
year due to aging. In addition the receiver oscillator has been allocated 
+1 part in 10° per year for aging, thus implying a total potential annual 
variation to be accommodated of +3 parts in 10° (approximately +57 
kHz at 19 GHz). Consequently, the receiver capture and tracking range 
was specified to be 100 kHz. 

In addition to maximum frequency excursion considerations, the 
short-term rate of change of frequency must be accommodated. This is 
a major complication since receiver sensitivity must be obtained by se- 
vere band limiting. The receiver configuration chosen utilizes parallel 


* Only one polarization is processed. 

1 G4, the antenna gain, is determined in Section 2.3. 

t The beacon design objective was to provide 90 percent of the signal power in a band- 
width of 100 Hz at 19 GHz, or within.150-Hz bandwidth 29 GHz. 
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processing: a set of 32 narrowband comb-filters for selectivity, and an 
AFC loop to center the comb set about the instantaneous received fre- 
quency. The 32-filter comb set in the 19-GHz receiver spans a total of 
1.6 kHz; the AFC loop drives the frequency of the down-converted signal 
to the center of the comb filter range, allowing only a +800 Hz band to 
account for drift errors which are uncompensated by update information 
which, of course, would be unavailable during periods of severe fading. 
The maximum estimated drift rate* of approximately 1 Hz/sec implies 
that, even in the event of deep fades (which preclude feedback frequency 
updating), there would be no reacquisition delay for outages less than 
about 15 minutes. Still longer outages could allow drift accumulations 
greater than 800 Hz, with consequent delay to reacquisition. This com- 
promises data, in that the end of fade would be uncertain. Therefore 
beacon frequency prediction based upon beacon behavior observed prior 
to the fade is included in the design. This feature extended the receiver 
capability for continuous operation in extremely deep fade situations 
(i.e., no frequency update information) from 15 minutes to over 2 
hours. 


2.3 Satellite stationkeeping 


The satellite, while nominally stationary, actually moves within a 
station of +0.1° in latitude and +0.1° in longitude. As a consequence, 
the ground station antenna must either track this variation or sacrifice 
absolute gain to provide essentially equal response within the sector 
traversed by the satellite. Allowing a 1-dB maximum pattern variation 
due to stationkeeping implies a minimum 3-dB antenna beamwidth of 
about 0.3°. This corresponds to an antenna gain of about 56 dB, which 
is consistent with the minimum antenna gain requirement (> 50 dB) 
necessary to provide the required dynamic range.t 


2.4 Remote test sites 


Practical operation of remote unattended stations requires that 
equipment be conservatively designed, with broader operating margins 
than would be necessary if frequent adjustments could be made, and the 
equipment must have automatic (re-)startup features. In addition, re- 
dundant recording equipment is necessary to preclude the loss of in- 
teresting data. Finally, representative data should be remotely accessible 
to allow daily monitoring for both the health of the equipment and the 
progress of the experiment. 


* Based on COMSAT preflight test curves. 
+t Antenna selection is discussed in Section 5.1. 
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2.5 Correlation with radiometers 


A secondary objective of the experiment was to allow detailed com- 
parison of radiometer and beacon observations, hence the requirement 
to derive a radiometer signal from the same antenna as that supporting 
the beacon receivers. A 13-GHz radiometer was chosen; this provides 
a reasonable match to the dynamic measurement ranges obtained with 
the beacons and, through frequency scaling, allows calculation of ab- 
sorption at the beacon frequencies. Scattering losses may be estimated 
and combined with the measured absorption losses, scaled for frequency 
differences, and compared with the beacon losses. 


2.6 Cost 


The final restriction, obligatory to all operations, is cost limitation, 
particularly since multiple sites were to be equipped. It was this con- 
sideration that tipped the balance in favor of a fixed, limited gain an- 
tenna. This restriction also dictated that standard, readily available 
components be used in lieu of custom devices. 


ltl. DATA ACQUISITION 


The operation of numerous remote sites, both for beacon reception 
and for associated radiometer studies, places a high emphasis on data- 
remoting capabilities (see Fig. 1). It is necessary, of course, to transfer 
data between associated sites to assess diversity performance; it is equally 
necessary to transfer data back to Bell Laboratories, Holmdel, for 
monitoring purposes. The transfer between test sites is accomplished 
using a 12-bit analog-to-digital converter to drive an FSK telemetry 
system over dedicated phone lines. At the receiving end, the digital signal 
is reconverted to analog and processed in conjunction with signals re- 
ceived directly at the main site. Transmittal of summary data back to 
Holmdel is accomplished on a dial-up basis over the DDD network once 
a day. 

Data are recorded in a number of different forms: At each site a real- 
time record of the various received signal levels is kept on stripcharts. 
At the main sites, Palmetto and Grant Park, stripcharts record the si- 
multaneous levels of both the local and the diversity signals. In addition, 
at each main site there is a Portable Propagation Recorder (PPR),° which 
records the accumulated time during which the signal level is below 
various fade thresholds, as well as the number of times that a threshold 
level is traversed. This record is stored in a solid-state memory, the 
contents of which are accessible by telephone. Finally, to ensure against 
accessing problems, equipment failure, or holiday weekends, the data 
are dumped daily onto a local punched paper tape. 
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For several years prior to the beacon experiment, there existed at 
Palmetto a computer-directed data-gathering complex supporting other 
propagation experiments, including the ongoing radiometer tests. This 
complex, known as MIDAS (Multiple Input Data Acquisition System) 
uses magnetic tape to store a sampled time record of signal variations. 
Its sampling rate is five times per second on each channel. For this ex- 
periment the Palmetto MIDAS complex records the levels of not only both 
beacon signals and the on-site 13-GHz radiometer signal at Palmetto 
but also the 18-GHz radiometer signal from the remote diversity site. 
In addition it records the instantaneous beacon frequency and the local 
rain rate as sampled in a tipping bucket gauge. The magnetic tape record 
is then mailed to Holmdel on a weekly basis for processing. 


IV. RECEIVER OPERATING OBJECTIVES 
This beacon receiver was designed to the following objectives: 


(t) Acquire the signal within 15 seconds either after turn-on or after 
an extended period of signal dropout. Similarly, in the event of power 
failure, automatically reacquire the beacon signals without external 
intervention. . 

(ii) Provide accurate (+0.5 dB absolute and +0.1 dB relative) fade 
indications over at least a 30-dB dynamic range, at both 19 and 29 
GHz. 

(tit) For periods up to 1 hour, in the event of loss of signal due to 
fading, reacquire (virtually instantaneously) when the signal recovers 
to the fade depth at which it was lost. 

(tv) Track frequency variations and provide output indications ac- 
curate to approximately 1 part in 10°. 


Kach of the above objectives was not only achieved but the realized re- 
ceiver exceeded the required performance. 

The functions identified above are realized in the receiver ee in 
the simplified block diagram in Fig. 2. Each functional block is treated 
in some detail following a brief operational summary. 


V. RECEIVER REALIZATION 


Initial acquisition of the 19.04-GHz signal is accomplished by a 
scanning receiver which searches a bandwidth of 95.9 kHz for the 19-GHz 
tone and the two (polarization) switching sidebands, which are 1 kHz 
removed from the carrier. 

Following acquisition, the two receivers, one at 19 GHz and one at 29 
GHz, begin monitoring fades. Associated with the 19-GHz receiver is 
a frequency predictor which, under normal operating conditions, con- 
tinuously monitors the received beacon (19 GHz) frequency and the rate 
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Fig. 2—Simplified receiver block diagram. 


at which it is drifting. The coherent relationship between 19- and 29-GHz 
beacons allows slaving of the 29-GHz receiver frequency to the 19. In the 
event of loss of the 19-GHz signal for an extended period of time, the 
predictor extrapolates the last valid frequency measurement using the 
most recently estimated frequency derivative. This ensures that the local 
oscillators of the receivers track the beacon frequency and are in best 
condition to reacquire the signal once it reappears. Note that loss of only 
the 29-GHz signal has no impact on the frequency tracking of either 
receiver. 

In addition to the basic functions identified previously, there are 
several other functional units which are separately identified and dis- 
cussed prior to treatment of the major units. These units are: antenna 
assembly, frequency multiplier, frequency synthesizers, and comb 
filters. 


5.1 Antenna assembly 


The antenna assembly, shown diagramatically on Fig. 3, includes: 
antenna mount with azimuth and elevation adjustments, antenna, 
feedhorn, polarization adapter assembly, polarization coupler and a 
frequency diplexer. 

The functions of the antenna assembly are to: intercept sufficient 
beacon signal energy for processing, separate a 13-GHz signal for driving 
a radiometer, and separate the 19- and 28.5-GHz signals for measure- 
ment by the beacon receiver. 

Recall, from Section 2.3, that a minimum 3-dB bandwidth of 0.3° was 
necessary to meet the amplitude misalignment objective and that it was 
this beamwidth constraint that restricted the antenna aperture to 8 feet. 
The antenna selected is a CH-8 (7.5-foot aperture) conical, horn reflector 
antenna manufactured by Antennas for Communications Incorporated. 
Using a specially designed feed horn tapered to WC-65, measurements 
were made at 19.04 GHz and 26.01 GHz (the highest frequency available 
from the equipment used for tests) which indicated gains at the two 
frequencies of 51.0 dB and 53.6 dB for the vertical polarizations and 51.0 
dB and 53.4 dB for the horizontal polarization. Scaling to 28.56 GHz 
implies gains of 54.5 dB for the vertical and 54.2 dB for the horizontal 
polarizations. Since the satellite could traverse a £0.1° window, it was 
important to determine the impact of a fixed orientation on received 
amplitude. At 28.5 GHz, a 0.1° misalignment results in approximately 
1-dB gain decrement. At the time the receiver was installed, the antenna 
orientation was optimized for the two beacon frequencies and then se- 
cured permanently. ; 

The polarization adapter assembly is a rotatable framework attached 
to the feedhorn which permits continuous adjustment of the angular 
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Fig. 3—Antenna assembly and RF front end. 


relationship between the feedhorn (antenna) and the polarization cou- 
pler. This ensures that the nominal received signal polarization can be 
accounted for readily, thus placing the polarization coupler in position 
to couple maximum energy through the vertically polarized port for both 
19 and 29 GHz. 

The 13-GHz radiometer signal is obtained via a polarization coupler 
attached to the feedhorn. It is obtained from the “horizontally” polarized 
port, while the orthogonal port* provides the “vertically” polarized 
signals (19 and 28.5 GHz) which drive the beacon receiver. This coupler 
provides considerable flexibility and introduces only 0.1 dB insertion 
loss at 138 GHz, 0.25 dB at 19 GHz and 0.35 dB at 28.5 GHz. The return 
loss at each of these frequencies is greater than 30 dB. This technique 
provides the beacon receiver only one component of the 19-GHz signal; 
the horizontally polarized component is terminated. 

The final item of the antenna assembly is the diplexer. Conceptually, 
this is a waveguide 120° Y junction with a high-frequency “short” in one 
output leg and a low-frequency “short” in the second output leg. This 
readily permits separating the two beacon signals at a loss (insertion) 
penalty of 0.2 dB at 28.5 GHz and 0.3 dB at 19 GHz. 


5.2 Frequency multiplier 


The receiver utilizes 5 IF frequencies? to obtain the required gain and 
selectivity prior to detection. The first IF operates at 1.0 GHz; the first 
beat-oscillators (BO at 18.04 GHz and 27.06 GHz) are obtained from a 
frequency multiplier built by RDL to Bell Laboratories specifications. 
This supply is basically a multiplier chain (X288 and X432) which op- 
erates upon one precision reference frequency of 62.53888 MHz which 
is developed within the receiver from a 10.000-MHz reference source. 


5.3 Frequency synthesizer 


Two Rockland frequency synthesizers are used for the fourth BO, one 
in the main and one in the scanning receiver (see Section 5.5), to down- 
convert from 2 MHz to 123 kHz. In addition, the digitally controlled 
synthesizer provides the incrementally adjusted (100-Hz steps) fre- 
quency control needed for the scanning receiver; the second synthesizer 
provides the fine resolution (12.5 Hz) steps needed in the AFC loop of 
the tracking receiver. These synthesizers were selected because of two 
features which make them particularly attractive—frequency changes 
are essentially instantaneous, and the phase is continuous. This is ac- 


* The two signals are orthogonally polarized and are nominally called horizontal and 
vertical polarizations but are not actually H and V polarized at the receiving sites. 

t For the 19-GHz receiver they are: 1 GHz, 20 MHz, 2.0 MHz, 123 kHz, and 10.5 kHz. 
For the 28.5-GHz receiver the frequencies are scaled in a ratio of 3:2. 
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Fig. 4—(a) Discriminator output with synthesizer set to 1.000,000,000 MHz. (b) Dis- 
criminator output with synthesizer set to 0.999,560,000 MHz. 





complished by using the phase as the driving variable within the syn- 
thesizer. A particular signal frequency output is achieved by controlling 
the rate of change of phase: Thus when a frequency change is called for, 
the rate at which the output signal phase accumulates is either increased, 
for a higher output frequency, or decreased for a lower output frequency. 
No discontinuities occur in the phase of the output signal, and thus there 
is no need to wait for filter transients to damp in the receiver whenever 
the synthesizer output frequency is changed. Without this capability, 
each change in frequency during scanning or tracking would necessitate 
delays for filter settling. 

The synthesizer is driven from a source derived from the 10.000-MHz 
reference supplying the frequency multiplier discussed above. All BO 
frequencies in the receiver are derived from this reference, thereby 
eliminating many potential problems such as relative drift among the 
frequencies. 

Although the digital frequency control made the system attractive, 
this approach also implies a certain amount of phase noise arising from 
the A/D quantization in the synthesizer. This caused some initial dif- 
ficulties since the synthesizer output could not be used directly due to 
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incidental FM (phase noise) it contained. For example, with the output 
set for 1.000,000 MHz the phase noise appears random and spectrally 
~ smooth, but with slight changes in output frequency a structured noise 
spectrum became apparent. For example, with an output frequency of 
999.560,000 kHz, a spectral analysis revealed a number of tones (with 
a dominant noise tone at about 3.6 kHz offset), of sufficent amplitude 
to produce a 7-Hz peak-to-peak deviation (see Fig. 4). (For other 
frequencies, the noise tones would, of course, vary, but the problem is 
illustrated by this example.) To reduce this phase noise, the signal was 
first mixed up to 18.77 MHz (which maintains the same level of phase 
noise) and then divided by 10—which reduces the phase noise by 20 dB. 
This ensured that the residual incidental FM would be of no consequence 
in beacon reception. 


5.4 Comb filters 


In order to reduce the time needed to acquire the beacon signals, to 
provide a wide frequency range over which accurate signal level deter- 
mination can be made (while providing narrowband processing for noise 
limiting), parallel signal processing was incorporated. The first stage of 
that parallel processing is a set of 32 high-resolution, closely matched 
filters of 100 Hz bandwidth and spaced by 50 Hz. These comb filters span 
the range from 9.7 kHz to 11.3 kHz. The output signal from each filter 
is detected and the frequency synthesizer is incremented to continually 
drive the frequency of the highest detected signal level (assumed to be 
the beacon) to the center of this 1600-Hz band. As the beacon frequency 
drifts, the detected signal level in the center filter tends to fall as the level 
of an adjacent filter increases. Figure 5 illustrates the condition: output 
b >a >c, which implies that the signal is within the region f, — 25 Hz 
to fp. If we then note the magnitude of the difference between the output 
of b and the output of a we can further subdivide the region. If b — a > 
1 dB then the signal is located within 12 Hz of the center of filter b. 
Thus 1214 Hz resolution can be obtained; when the signal drifts further 
than 124 Hz from the comb center, the BO is readjusted to drive it back. 
Parallel processing ensures that if the beacon signal fades below a de- 
tectable level (about 38 dB fade, remaining faded for an extended peri- 
od), upon its recovery to a detectable level, if it lies anywhere in the band 
covered by the comb set, it will be detected virtually instantaneously. 
The actual detection process associated with the comb filter outputs is 
discussed in Section 5.7. 


5.5 Scanning receiver 


The scanning receiver is essentially a scanning spectrum analyzer. Its 
input is the 2.0-MHz third IF; this signal is twice down-converted to drive 
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Fig. 5—19-GHz receiver comb filter characteristic. 


two sets of comb filters. The two sets of filters have their outputs pro- 
cessed alternately, thus doubling the allowed settling time per filter while 
maintaining a rapid scan time (see Section 5.6). Each comb set consists 
of three 100-Hz filters separated by 1 kHz. The two comb sets are offset 
from each other by 50 Hz. The filter outputs are linearly detected to 
estimate power in the band and the entire frequency band of interest 
is scanned by causing the frequency synthesizer to shift the fourth Bo. 
As the filter outputs are measured, the controller portion of the receiver 
stores observations, correlating the fourth BO frequency with the power 
received. At the completion of a scan, the location of the highest detected 
value is assumed to be the location of the carrier; however, the processor 
also checks to ensure that sidebands, down 4 dB from the carrier, are 
found in frequency slots 1 kHz above and below the carrier. If the levels 
are incorrect by more than 1 dB relative to the carrier, the data is not 
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considered valid. When a valid signal is found, i.e., a carrier of sufficient 
level to be a candidate and having sidebands appropriately located and 
energized, the frequency location is stored. This frequency is compared 
with that of the main receiver and the offset, if large enough, is used for 
correction of the fourth Bo. 

Although continuous scanning is desired, it is imperative that spurious 
responses not input false information to the basic receiver. Thus, update 
is inhibited without the presence of both carrier and two sidebands; 
additionally, the absolute signal-to-noise level must be high enough to 
ensure valid data. If these conditions are satisfied and the frequency 
difference between the scanning receiver and the main receiver is 187.5 
Hz or greater, the scanning receiver output will correct the main receiver 
tuning. The intent is to ensure that the received signal is maintained as 
close as possible to the center of the comb filter set to provide maximum 
accommodation to drift. The 187.5-Hz threshold is somewhat arbitrary 
but derives from the smallest frequency increment (12.5 Hz) multiplied 
by 15—which is the maximum count of a four-stage counter. 

Under normal operating conditions the scanning receiver continues 
to operate, but its output is nonfunctional. It ts functional at initial ac- 
quisition, and because the site is remote and unmanned, it must also 
function in reacquisition should tracking be interrupted for two hours 
or more. 


5.6 Equipment design considerations 


Since reliable operation was desired even under worst case conditions, 
the maximum anticipated frequency drift range of the beacon, +2 ppm 
for aging and diurnal variation, was used in determining the maximum 
range, the design goal was to locate the signal to within 25 Hz, using the 
commodate any unforeseen variations* which might cause the beacon 
frequency to drift beyond the receiver window. In spite of this large scan 
range, the design goal was to locate the signal to within 25 Hz, using the 
scanning receiver, and to accomplish this in a maximum of 15 seconds. 
The need to make the receiver relatively inexpensive and to accomplish 
the realization in a short time frame implied that the filters would have 
to be LC rather than crystal. Selection of the final IF was based on several 
considerations. The bandwidth desired was 100 Hz to match the beacon 
specification of 90 percent of the 19-GHz energy contained within a 
100-Hz band (150 Hz for the 29-GHz signal). The achievable filter Q was 
limited by the coils used. Below about 10 kHz the coil Q fell off faster 
than frequency, thus setting 10 kHz as the lowest operating frequency. 
Higher frequency operation would be threatened by proportionately 


* This includes +1 ppm for beacon receiver aging. 
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greater temperature drift problems. Thus, the best engineering tradeoff 
between operating frequency and component availability, stability and 
tolerance suggested a final IF of about 10 kHz. 

Conceptually, a single 25-Hz filter/detector system could then be used 
to find the signal (carrier and sidebands) with the processed results being 
stored. However, this would result in a long scanning time to allow suf- 
ficient filter settling time (60-100 msec per step) and a resultant unac- 
ceptable scan time of 3-5 minutes. As an alternative, two sets of filters 
of 100-Hz bandwidth were constructed. Each set was constructed to si- 
multaneously monitor three slots separated by 1 kHz. This allows im- 
mediate recognition of the carrier and its two sidebands. The second set 
of filters is offset from the first by 50 Hz, permitting 25-Hz frequency 
accuracy and a full scan in only 15 seconds. Frequency interpolation is 
possible because the filter responses are well controlled and relative 
power levels are determined by signal frequency. As can be seen in Fig. 
6, if b > c > a, where a, b, c represent the powers in the respective fre- 
quency bands, then the beacon signal must lie between 10.5 kHz and 
10.525 kHz. The speedup in scan time (relative to the conceptual 25-Hz 
filter approach) is due to increasing filter bandwidth from 25 to 
100 Hz. 

Two critical functions in the scanning receiver are linear detection and 
sample-and-hold. The detectors, well matched and linear to within 0.1 
dB over a 60-dB range, are built around the LM318N op amp with 
458-type diodes in the feedback path. The sample-and-hold is especially 
critical since the peak detected sample must be stored for a full scan of 
15 seconds. Stable storage with time is most critical when sampling the 
spectrum in the vicinity of the carrier and its sidebands, i.e., the carrier, 
the first and third harmonics (a total of 6 kHz) sliding through a 2-kHz 
window for a total of 8 kHz. With 1/64 sec allotted per step,* this implies 
a critical storage time of about (8 kHz/100 Hz/step) (4 sec/step) = 1.25 
seconds. The critical limiting parameter in the receiver is thermal noise. 
Ensuring that all other degradations (such as signal droop in the sam- 
ple-and-hold circuit) contribute small degradation relative to noise re- 
quires that droop be limited to no more than 0.05 dB in 1.25 seconds. 
This corresponds to a maximum droop of no more than 0.6 dB over the 
15-second hold time. The current drive capabilities of the op amp, cou- 
pled with the slew rate requirements of the sample-and-hold limit the 
size of the capacitor to about 0.02 uF. For this size capacitor, a 0.05 dB 
droop in 1.25 seconds implies a maximum leakage of 1/4 nA. Choice of 
a polystyrene capacitor for storage ensured that the primary source of 
this leakage would be the reverse bias of the detector diodes. Back bias 
on most diodes would result in several nanoamps leakage—much too 


* 15 sec scan time/(960 kHz band/100 Hz steps) = Yq sec/step. 
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Fig. 6—Scanning receiver comb filter characteristics. 


large. However, by introducing circuitry (see Fig. 7) to ensure that the 
bias level across the diodes is virtually zero, i.e., less than 10 mV, the 
leakage current reduces to less than 1 pA. In a similar fashion, another 
diode associated with the hold circuit was compensated. This left, as the 
only current of consequence, the input bias current of the isolation 
amplifier of the hold circuit. Using a LH0022C, which limits maximum 
bias current to 50 pa, ensured that droop would not be a problem. 


5.7 19-GHz receiver 
The 19-GHz receiver shown in Fig. 8 tracks the beacon signal, detects 
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= CONTROL 


its level, and provides the 3/2 frequency-scaled BO for the down-con- 
version of the 29-GHz signal. 

In the interest of keeping waveguide loss to a minimum, the first 
down-conversion to 1 GHz and amplification of the beacon signal (by 
about 24 dB) takes place on the polarization adapter assembly which 
is mounted on the antenna feedhorn. See Fig. 9.* The Space Kom 
down-converter is a Schottky-diode balanced mixer which, combined 
with a low noise preamp, yields a combined noise figure of 6.5 dB (due 
primarily to the mixer). The source of the 18-GHz Bo is the frequency 
multiplier described earlier. The resultant 1-GHz first IF signal, after 
bandpass filtering to 12 MHz to limit noise and reject images, is then 
processed by the main 19-GHz receiver. As indicated in Fig. 8 the second 
down-conversion, to 20 MHz, is accomplished using a double balanced 
mixer. This mixer requires matched impedance at each port which is 
constant at 20 MHz as well as 1 GHz. This was realized by following the 
mixer with a 2-pole Butterworth (constant resistance) LPF with a 100- 
MHz Bw and then 15 dB of gain. To limit unwanted signals, a 2-pole 
Chebyshev image rejection BP filter with 46 dB of rejection is used prior 
to the next down-conversion. 

After the third down-conversion the signal is split and used to drive 
the 19-GHz receiver as well as the scanning receiver discussed earlier. 
The signal has now been amplified from a nominal —110 dBm to —53 
dBm. After the fourth conversion to 123 kHz, a 4.0-kHz BW, BP filter 
(Chebyshev, 0.01 dB ripple) is used to further restrict the bandwidth. 
Note that the source of the fourth BO used in this down-conversion is 


* Note that the RF front end and temperature-controlled reference noise source for the 
13-GHz radiometer are also mounted on the adapter assembly. 
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Fig. 9—Polarization adapter assembly. 


the digitally controlled frequency synthesizer, which is tuned in 12.5-Hz 
increments to continually center the received signal at 123 kHz. After 
down-conversion to 10 kHz the output is passed through a 25-kHz one- 
pole active Chebyshev low-pass filter. At this point the signal level has 
been boosted to 14V P-P, and it is this signal which drives the comb filter 
detectors. 

The five down-conversion steps were made along the way to allow 
progressive tightening of the noise bandwidth without requiring exces- 
sively high Q. By gradually narrowing the noise band, the filters can be 
designed with readily achievable Q using standard L/C technology, thus 
precluding the need for costly, specially designed high-Q crystal fil- 
ters. 

The actual detection process is illustrated in Fig. 10. The fifth IF signal 
is simultaneously fed to a set of 32 filters of 100-Hz bandwidth on 50-Hz 
centers spanning a frequency range from 9.7 kHz to 11.3 kHz. Each filter 
is a 2-pole Chebyshev with a 0.1-dB bandwidth of 50 Hz. Because of the 
well-controlled filter shape, it is possible to make an accurate determi- 
nation of signal location from the relative levels in several filters. Each 
detector output (one detector/filter) is post-detection filtered to 1-Hz 
bandwidth to further reduce noise. Samples of the post-detection filter 
outputs are multiplexed together and scanned by a peak detector. A 
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Fig. 13—Fade distribution in Palmetto, Georgia. 


sample-and-hold following the peak detector always acquires the most 
recent sample of the received signal and is updated at each scan of the 
comb filter outputs. Also, by noting the location of the maximum signal 
within the comb set, together with the setting of the synthesizer, the 
beacon frequency is determined. Note that the comb filters have a 
well-controlled 50-Hz flat bandpass spaced on 50-Hz centers with the 
center frequency controlled to +1 Hz, a 75-Hz 1-dB bandwidth, and a 
100-Hz 3-dB bandwidth. As discussed in the section on comb filters, it 
is possible to get 1214-Hz resolution of the beacon frequency and thus 
to increment the frequency synthesizer so as to continually drive the 
beacon signal to the center comb position. 


5.8 28.5-GHz receiver 


Since the 28.5-GHz transmitter is scaled in frequency by 3/2 from the 
19-GHz transmitter, the 28.5-GHz receiver local oscillator frequencies 
are also derived directly from those of the 19-GHz receiver. In this way 
a single AFC loop is sufficient to center both received signals in their 
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respective filters. Each step in the 28.5-GHz receiver processing is 
analogous with the 19-GHz receiver except that all frequencies are scaled 
up. 

The signal is down-converted in five steps, using double balanced 
mixers and filtering to reject images generated in the process. The final 
processing is done at 12 kHz with a 150-Hz filter. This output is passed 
through a linear detector followed by a 14-Hz LPF to average out the 
noise. Note that since the 19-GHz signal will undergo less severe fading 

than the 28.5-GHz signal, the 19-GHz signal will always recover first and 
thus provide valid frequency information for detecting the 29-GHz signal 
before the 29-GHz signal actually recovers. 


5.9 Frequency predictor 


As indicated earlier, there will be occasional periods during which the 
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beacon signal will be attenuated below the level of reliable reception. If 
these periods are relatively long, say 30 minutes or more, the frequency 
drift of the beacon may be sufficient to remove it from the range of the — 
receiver. T’o avoid this potential problem, a frequency drift predictor 
was incorporated in the receiver. Noting the frequency of the beacon, 
as indicated by the settings of the synthesizer, and the location of the 
peak signal in the comb filter set, storing these indications each scan and 
averaging over a long time period (512 seconds), the average rate of 
change of frequency can be determined. This drift rate is continually 
updated as long as the received signal is strong enough for reliable 
tracking. Whenever the signal level falls below that level, presumably 
due to heavy rainfall, the drift rate update is stopped. Instead, the most 
recently calculated drift rate is used to increment the synthesizer and 
drive the local oscillators to the frequency anticipated by frequency 
extrapolation. This technique had been found to increase the tracking 
receiver’s capability for immediate acquisition, extending that capability 
from 15 minutes to over 2 hours. 


Vi. EARLY RESULTS 


Detailed reduction of the data is, of course, ongoing; there are, how- 
ever, several items of interest already gleaned. 

Immediately following installation of the beacon receiver in Palmetto, 
a very low level, approximately 0.25-dB peak-to-peak, periodic amplitude 
variation was detected in both the 19- and 28.5-GHz received signals; 
see Fig. 11. The frequency of this oscillation, ~54 times per minute, was 
found to coincide with a 0.04-degree precessing of the satellite arising 
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Fig. 16—Diversity results, Georgia. 


from spin stabilization. This phenomenon was not detected at the other 
beacon reception sites in Grant Park or Crawford Hill due to the fact that 
Palmetto was closer to the edge of the satellite antenna pattern and thus 
experienced a greater amplitude change as the satellite pattern 
scanned. 

The long-term attenuation (time faded below) of the beacons and of 
the 13-GHz radiometer, at both Grant Park and Palmetto, are shown 
in Figs. 12 and 13 respectively. Very good agreement among the three 
observations was demonstrated by selecting a particular level of occur- 
rence, say 0.1 percent, and determining the attenuation ratio between 
two of the curves, e.g., the 28.5 GHz and the 138 GHz. From that ratio an 
equivalent rain rate can be deduced from Setzer’s® work (attenuation 
ratios as a function. of rain rate were derived from Setzer’s results and 
are summarized in Fig. 14) and an estimate of the attenuation ratio be- 
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tween (say) the 19 GHz and 13 GHz can be obtained for the same rain 
rate. The X’s in Fig. 13 correspond to 19-GHz attenuation deduced in 
the above manner. Similar results hold if the 13-GHz and 19-GHz signals 
are used to extrapolate to the 29-GHz attenuation curve. 

The preliminary diversity results for Grant Park and Palmetto are 
given in Figs. 15 and 16. Note that the Grant Park results are biased by 
the fact that the only appreciable rainfall in the data base occurred 
during a brief period during the summer of 1976. During this one period 
of rain the site diversity improvement factor (the ratio of single site time 
faded below to diversity time faded below) was approximately 12, con- 
siderably less than the improvement factors of 50 we have seen in the 
past for equivalent fade levels. 
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COMSTAR Experiment: 


Notes on the COMSTAR Beacon Experiment 


By E. E. MULLER 


(Manuscript received December 5, 1977) 


Definitive empirical characterization of the transmission properties 
of the atmosphere has long been limited by the lack of appropriate 
sources radiating from beyond the atmosphere. The COMSTAR beacons 
provide appropriate radiation to interested experimentors throughout 
the continental United States. 


Government, commercial, and scientific interest in transmission 
through the atmosphere at frequencies above 10 GHz derives from the 
potential for employing this portion of the spectrum for satellite com- 
munications and is basic to the attention being paid the COMSTAR 
beacon experiment, both within the U.S. and abroad. This opportunity 
for improving future generation communication satellites underlies 
_A.T.&T.’s provision for carefully designed millimeter wave beacon 
sources in several earlier corporate proposals to the FCC. 

Soon after the FCC granted permission to proceed with COMSTAR, 
specifications describing the beacon characteristics were published in 
technical journals along with an invitation to build and operate equip- 
ment for their reception. This announcement was received enthusiast- 
ically, and on October 1, 1975, a group of 40 interested experimenters 
gathered at a first “COMSTAR Experimenter’s” meeting at Holmdel, New 
Jersey. Spacecraft development progress was discussed, along with early 
characterization results from prototype beacon equipments. Attendees 
signified their interest in participating in the experiment and outlined 
tentative plans. It was agreed that coexperimenters would meet from 
time to time, and that the data resulting from COMSTAR observations 
would be published in the open literature. 

COMSTAR D1 has been in service now for almost two years. Beacon 
radiations have been observed at Bell Laboratories sites for this entire 
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period, beginning even before the satellite D1 came on station. Aspects 
of COMSTAR’s behavior have been reported at professional meetings and 
in journals. Participation in the experiment is gratifying; at the present 
time observations are in progress at: 

University of South Florida, Tampa, Fla. 

Virginia Polytechnic Institute, Blacksburg, Va. 

Johns Hopkins University, Laurel, Md. 

Air Force Cambridge Research Labs, Hanson Field, Mass. 

Institute for Telecommunications Science, Boulder, Colo. 

COMSAT Laboratories, Clarksburg, Md. 

General Telephone and Telegraph, Waltham, Mass. 

In addition, several universities and government agencies have pro- 
grams directed toward equipping facilities receiving the beacons at other 
locations. 

The service lifetime of these beacons has three determinants: the 
lifetimes of critical devices, particularly the IMPATT amplifiers; the re- 
dundancy and fail-safe features provided in the system design; and the 
power budget of the spacecraft itself. These units have been designed 
to provide a minimum of two years of useful operation, as such a period 
is consistent with stable estimates for attenuation behavior. This goal 
would appear to have been realized in spacecrafts D1 and D2, although 
the 19-GHz radiation from satellite D1 has a predictable anomalous 
behavior during portions of the satellite thermal cycle. Long-term beacon 
availability depends, therefore, on continued access to surplus spacecraft 
power—a commodity which decreases with increased communications 
load and decreased solar cell efficiency. Given the staggered launch 
schedule of the COMSAT vehicles, the pertinent design parameters, and 
our present experience, it is expected that COMSTAR beacon signals 
should be available at least until the early 1980s. 
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Prolate Spheroidal Wave Functions, Fourier 
Analysis, and Uncertainty— 
V: The Discrete Case 


By D. SLEPIAN 
(Manuscript received July 20, 1977) 


A discrete time series has associated with it an amplitude spectrum 
which ts a periodic function of frequency. This paper investigates the 
extent to which a time series can be concentrated ona finite index set 
and also have its spectrum concentrated on a subinterval of the fun- 
damental period of the spectrum. Key to the analysis are certain se- 
quences, called discrete prolate spheroidal sequences, and certain 
functions of frequency called discrete prolate spheroidal functions. 
Their mathematical properties are investigated in great detail, and 
many applications to signal analysis are pointed out. 


1. INTRODUCTION 


In many branches of technology, such as sampled-data theory, time- 
series analysis, etc., doubly infinite sequences of complex numbers, {h,} 


=...,h_4, ho, hi, ... play an important role. Associated with such a 
sequence is its amplitude spectrum 
A(f) =X hpe?nrl. (1) 


In this paper we attempt to elucidate certain features of the complex 
relationship between {h,} and its amplitude spectrum H(f). 

Of prime importance in the analysis we present are some special se- 
quences, here called discrete prolate spheroidal sequences (DPSS’s), and 
some related special functions called discrete prolate spheroidal wave 
functions (DPSWF’s). Much of the paper is devoted to a study of their 
mathematical properties. They are fundamental tools for understanding 
the extent to which sequences and their spectra can be simultaneously 
concentrated: they have many potential applications in communications 
technology. 


1371 


We motivate our work by discussing a simple problem. But first some 
notation is needed. We adopt the abbreviation 


E(nyno)= ¥ |hal2 (2) 


n=nt 
and refer to this quantity as the energy of the sequence {h,} in the index 
range (n1,nN2). Throughout this paper we restrict our attention to se- 
quences whose total energy E = E(—~@,@) is finite. Associated with a 
sequence is its amplitude spectrum defined in (1). It is periodic in f with 
period 1 and we shall generally consider it only for |f| < 4%. From the 
theory of Fourier series, we then have the representation 


1/2 : 
ha = ff. (Hie tinldf, n= 0,41)... (3) 


for the sequence, and from Parseval’s theorem we have that 


ro ) 1/2 
B= |hql? = te |H(f) |2df. (4) 


If H(f) is given, we say that the sequence {h,,} defined by (8) is the se- 
quence belonging to H and we write {h,,} <> H(f). 

Now let W be a positive number less than 4. If the amplitude spec- 
trum of {h,} vanishes for W < |f| < 1%, we say that the sequence is band- 
limited and that it has bandwidth W. The elements of a bandlimited 
sequence can be written in the form 


Ww 
h, = f yy ife tnd, 0<W< ; , n=0,+1,... (5) 


Analogously, given two finite integers, n; < ne, we shall say that a se- 
quence {h,,} is indexlimited to the index interval (n1,n2) if h, vanishes 
whenever n > ng orn < nj. It is not hard to see that, except for the trivial 
all-zero sequence, a bandlimited sequence cannot be indexlimited and 
that an indexlimited sequence cannot be bandlimited. 

It is natural now to ask just how nearly indexlimited a bandlimited 
sequence can be. Specifically, we seek the maximum value of the con- 
centration 


d a — = (Erol?) /(X irl?) 6) 


for all sequences of bandwidth W, and ask for which bandlimited se- 
quences the concentration attains this maximal value. The answers to 
these questions are simply stated in terms of the discrete prolate sphe- 
roidal wave functions U;(N,W;f), the discrete prolate spheroidal se- 
quences {u*)(N, W)}, and their associated eigenvalues \,(N,W), k = 0,1,2, 
...,N—1. The bandlimited sequence {h,,} of bandwidth W most con- 
centrated in the sense of (6) is proportional to the DPSS {v® ,,(N,W)}, 


Ul 
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its amplitude spectrum is proportional to e!*(2Not+N-DIU (N, W;f) in 
the interval |f| < W, and its concentration is given by Ao(N, W). 

In earlier papers in this series!~* we treated the analogous problem 
of the maximal time-concentration of a continuous signal f(t) of limited 
bandwidth. The optimal signals in that case, prolate spheroidal wave 
functions (PSWF’s), were found to have many interesting and useful 
properties that were explored in related papers.®> 8 We here borrow freely 
from the techniques used in these earlier works and extend many of those 
results to the present case of discrete time series. Details of derivations 
that parallel closely ones to be found in Refs. 1-8 are sometimes omit- 
ted. 

Part of the material presented here has been anticipated by others. 
As early as 1964 C. L. Mallows in an unpublished work defined versions 
of the DPSWF’s and DPSS’s. He showed that the former satisfy a sec- 
ond-order differential equation and that the latter satisfy a second-order 
difference equation, and described a number of their other properties 
as well. Tufts and Francis!¢ in 1970 showed the importance of the DPSS’s 
in the optimal design of digital filters. Independently, Papoulis and 
Bertran!? in 1972 made a similar application. Eberhard?” in 1973 showed 
that the DPSS provide optimal design of a discrete window for the cal- 
culation of power spectra under a natural criterion. All of these later 
authors present some numerical values of the functions and of \o(N, W) 
for a few isolated values of N and W. None of them treat the subject as 
intensively as is done here. See Ref. 18 for some comments on these ap- 
plications. An interesting application in optics to the theory of image 
formation was made by Gori and Guattari?3 in 1974. 

Regarding the organization of this paper, in Section II we state without 
proof some of the more useful and interesting properties of the DPSWF’s 
and the DPSs’s. Included are curves and asymptotic formulae. In Section 
III we discuss some extremal properties and some applications of the 
functions. Of particular interest, perhaps, is the prediction problem of 
Section 3.2. In Section IV, proofs are given or outlined for the less obvious 
statements found in Sections II and III. 


ll. THE DPSWF’s, THE DPSS’s, AND SOME OF THEIR PROPERTIES 
Throughout the remainder of this paper, unless otherwise explicitly 

stated, N is a positive integer and W a positive real number less than 

Yo. 

2.1 The discrete prolate spheroidal wave functions 


Since its kernel is degenerate, the integral equation 


W SINE SED Seay =)V(f), -—2“<f< (7) 


-w sinz(f —f’) 
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has only N non-zero eigenvalues, Ao(N, W), A1~N,W),..., An—-1(N, W). 
They are distinct, real and positive and we order them so that 
Ao(N,W) > A(N,W) >... > An-1(N,W) > 0. (8) 


There are N linearly independent real eigenfunctions of (7) associated 
with these eigenvalues and we denote them by Uo(N,W;/f), U1(N,W5f), 
..., Un—-1(N, W;f). When these are normalized so that 


1/2 
f |Un(N,W3f)|2df = 1, 
—1/2 
dU, (N,W;0) > 0, 
df 
k=0,1,...,N—-1, 


U;.(N,W;0) = 0, (9) 


they are the DPSWF’s. Thus, the discrete prolate spheroidal wave 
functions U;(N,W;f) and their associated eigenvalues \;(N,W;f) are 
defined by 
W sin Nr(f — f’) 
-w sin z(f —f’) 
—o<f<o, kR=0,1,...,N—1, (10) 


along with (8) and (9) and the requirement that the U;, be real. 
The DPSWF’s are doubly orthogonal: 


Un (N, Wf )df’ = An (N,W) URN, Wf) 


f . U;(N,Wsf)U;(N, Wf)df 


= W/2 N. Pe N.W: d — 
=); f. (UN WiNUMN,Wifaf = ddiy (11) 


14j=0,1,...,N—-1. 


Fork =0,1,...,N —1, the function U; (N, W;/f) is periodic in f. It has 
period 1 if N is odd and period 2 if N is even. In either case we have 


Ux (N, Wf + 1) = (-I)N71U, (N, Wf), (12) 


while 


Us (N,W;5—f) = K(N,)Un-1-% (N,5— Wil) 


Me (WN, ; — W) = 1— dwa1-2 (NW) (13) 
(—1)N-D2Atk N odd 
K(N,R) = 
NE) epee N even. 
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Fig. 1—U, (4,0.4;f) for k = 0,1,2,3 and U; (5,0.4;f) for k = 0,1,2,3,4 for 0 < f < 0.5. 


The DPSWF U; (N,W;/f) has exactly k zeros in the open interval —W < 
f < W and exactly N — 1 zeros in —'4 < f < 4. It is an even or odd 
function of f according to the parity of k. Plots of some selected DPSWF’s 
are given in Figures 1 and 2. Note the inserts with changed scales needed 
to show detail of Up for 0.4 < f < 0.5 in Fig. 1 and for U4(5,0.2;f) in 0 < 
f < 0.1 in Fig. 2. Values of some A; (N, W) can be obtained from the or- 
dinates of the curves of Figures 3, 4, 5 and 6 corresponding to integer 
abscissa values. Figure 7 shows the dependence of some dA, (N,W) 
on W. 
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Let o(N, W) denote the N X N tri-diagonal matrix whose element in 
the ith row and jth column is 


ae 
iw i), “7-1 
5 i) j=l 
N-1 2 . 
— ee 2xW, j=l 
Mise ( ; i) cos2xW, ji 
il 33 : Be 
gt IN -1-9), j=itl 
0, 7 Sal Sa, 
Lj =0,1,...,N-—1. 


(14) 


The WN eigenvalues of this matrix are real and distinct. We denote them 
by 


O0(N,W) > 6,(N,W) >... > On-1(N,W). (15) 
Then the DPSWF’s satisfy the differential equation 


dU; (N, Wf) 


= [cos w — A] a 


+ | F0v2=1) cose H4(N,W) | TNWH) =0 (26) 


where we write 


w=2rf, A=cos2xrW. (17) 


2.2 The discrete prolate spheroidal sequences 


For each k = 0,1,2,..., N — 1, the DPss {uv (\N, W)} is defined as the 
real solution to the system of equations* 


N=! sin 2xW(n — m) 


vi (N,W) = AR(N,W)of?(N,W), (18) 
m=0 a(n — m) 


n=0,+1,+2,... 


normalized so that 


N-1 
x vi) (N,W)2 = 1, (19) 
fe 


N-1 N-1 
~ vf(N,W) 20, © (N-1— 2j)v(N,W) = 0. (20) 
0 0 


* Tt is understood here, of course, that when n = m the expression [sin 22W(n — m)|/x(n 
— m) has the value 2W. 
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Fig. 2—U;, (4,0.2;f) for k = 0,1,2,3 and Uz (5,0.2;f) for k = 0,1,2,3,4 for 0 < f < 0.5. 


The Az (N,W) here are, as before, the ordered non-zero eigenvalues of 
the integral equation (7). These quantities are thus seen to be also the 
eigenvalues of the N X N matrix p(N,W) with elements 


_ sin 27W(m — n) 
PAN,W) mn = ae 


and the (N — 1)-vector obtained by indexlimiting the Dpss {v{*)(N, W)} 
to the index set (0,N — 1) is an eigenvector of p(N,W). 


, mn=0,1,...,N—-1, (21) 
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Fig. 3—Values of \;,(N,0.4) fork =0,...,15 and N =0,1,..., 20. 


The DPSs’s are doubly orthogonal: 
N-1 : eo . ; 
D oP (N,W)o)(N,W) = Ai © vP(N,W)voY(N,W) = 4; (22) 
n=0 —o 


ij =0,1,...,N—1. 
They obey the symmetry laws 


v)(N,W) = (-1)F vf,_,(N,W) (23) 
v®)(N,W) = (-1)Fu f51;") (N, 12 - W), (24) 
n =0,+1,+2,... 


B= 0A iN SL, 
1.0 


0.8 


dc (NW) 
o 
fop) 


9 
b> 


0.2 / ASYMPTOTIC 


FORMULA 





Fig. 4— Values of \;,(n,0.2) fork = 0,...,9and N =0,1,... 20. 
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Fig. 5—1 — A, (N,0.4) fork = 0,...,7 and N = 1,2,..., 13. 


The DPSS’s indexlimited to (0,N — 1) satisfy the difference equa- 
tion 


1 
5 n(N — n)v®,(N,W) 


= 2 
+ | cos 2nW orm —- n) - o4(N,W) | v)(N,W) 


1 
Ss (n + 1)[N -—1—n]v®(N,W) =0, (25) 
k,n =0,1,...,N—-1. 


Here the 6’s are as in the differential equation (16). 
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Fig. 6—1 — dz (N,0.2) fork =0,...,5 and N = 1,2,...,13. 


2.3. Connections between the DPSWF’s and the DPSS’s 


We have 
N-1 
U2 (N, Wf) = ee X vl (N,Wh)e-it(N-1-2n)F (26) 
n=0 
k=0,1,...,N-—1, 
where 


Be (* k even (27) 


i, k odd. 
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Fig. 7—Az(N, W) = dz (N) vs. W for several values of k and N. 


Conversely, 
1 1/2 : 
v®(N,W) =— f. |, Un(N, Wif)eit\N-1-20)faf (28) 
€r J-1 ; . 


nk =0,1,...,N—1. 
But, one also has 
p(t) N, W fy, U N, W: ix(N—1—2n)fq 29 
( )= on W) rf fe f (29) 
k=0,1,...,N—-1 
for all values of n. 


It is convenient now to introduce the bandlimiting operator By de- 
fined by 


Hif), |fl<W 


a ea (30) 


BwH(f) = 


and the indexlimiting operator IX? defined by 
IN? {hn} = {gn} 
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where 
hn, Nyxn<Noe 


= (31 
oe 0, otherwise. ey) 


In terms of these operators, (28) and (29) can be stated 


ex! fun? (N, W)} > Up (N, Wife N—DF (32) 
edn (N,W)tvn?(N, W)} <> BwUR(N, Ws fei NDF, (33) 


For the sequence {u“")(N, W)} belonging to the DPSWF U;,(N,W;f) we 
have . 


1/2 
u®)(N,W) = f |, Us(N, Wsfle?*i"/df 
-1 








’ N-1 : 
sin 7 ( +n =9) 
N-1 
= & —_—_—_——— v")(N,W) 

j=0 N-1 : 

wT ( +n - i) 

2 
n = 0,£1,+2,... (34) 


When N = 2M + 1 is odd, this reduces simply to 
)(2M+1,W), |n| <M 

(2M + 1,W) = eae bie 
un ( ) 0, |n| > M, 


a multiple of the indexlimited shifted DPSS. Equation (29) shows that 
conversely the spectrum of the shifted DPSS is in this case a multiple of 
the bandlimited DPSWF, 


ep ARn(2M + 1;W){v (2M + 1,W)} <> BwU;,(2M + 1,W5f). (36) 


(35) 


2.4. Asymptotics of DPSWF’s 
In what follows, in addition to (17) we adopt the abbreviation 
. a=1-—-A=1-—cos2rw. (37) 


A. U;(N,W;f) for fixed k and large N 
When N is large and W and F& are fixed, 


‘cifi(), 0O<w<N-1% 
Cofe(w), N-13 < w S arccos [A + N-3/7] 

Un (N,Wf) ~ ¥ cafs(@), arccos [A + N-32] <w<2xW (38) 
caf4(w), 2aW <w K arccos [A — N-3/2| 


C5fs5(w), arccos [A — N-3/2] <w <a. 
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Here 


fi(w) = Dz (=) ne) 


[V1 + cos w + Vcos w — Al] 


fale) = [(1 — cos w)(cos w — A)]}/4 
x | V1—cosw | 
Va(1 + cos w) + V 2(cos w — A) 
N 
f3(w) = Io (Z= Vos oA) (39) 


OE (== VA= 005) 


fs(w) 


N 1 1 x 3a 
— arcsin 6(w) + = h +=) +(k—N)* =| 
cos | asin (a) +5 (k +5) atesin os) + (h—N) T+ S| 


[(A — cos w)(1 — cos w)]!/4 


~at2cosw _ (2 = 3a) — (2 + a) cos 
Hw) =— >, o(w) = (2 — a)(1 — cos w) 


where D;(-) is the Weber function (Ref. 9, Vol. II, Chapter VID, and 
Io and Jo are the usual Bessel functions. The constants in (38) are given 
by 


cp = (-1IFAI(RN)-1/2 4 M12 Yau Y3N Ya[/2 a Val Y5[2 — a] Ye 


l Yy Yo Y3 Y4 Y5 Y6 
1 1 1 

1 — 1 _— - 0 0 
i ig 3 A 
4 A 8 4 8 2 4 

3 oa La gee —N (N -—k—1)/2 
4 A 8 4 8 2 4 

eo Rages. iN +4 (N-k-1)/2. 
4 4 8 4 8 2 4 4 


B. U;(N,W;/f) for large N and k = |2WN(1 — «)] 
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When k and N become large together with 
k=|2WN(1-9)], 0<e<1 (41) 
and « fixed, then 
U;(N, Wf) 
digi(w), O<w <arccos (B+ N7!/2) 
dogo(w), arccos (B + N~1/2) < w < arccos (B — N-1/2) 
7 d3g3(w), arccos (B — N~1/2) < w < arccos (A + N-}) ‘as 
d4g4(w), arccos(A + N-!) <w <21W 
d4gs(w), 2rW <w < arccos (A — N-}) 
dg£e6(w), arccos (A — N71!) <w<d. 


Here B is determined so that 


S, Vee A) — era ey 
and 
ns moll LVS 

1S Se 


N w and 
Ria Ree Ee f Vet 





cos t — 
_o ¢° ‘a 
4 arccos B V (B — cos t)(cos t- A) 
gale) = Io (N Ve eo eee A) (44) 
B-A 

&5(w) = Jo (vA = (A — cos )) 

B- t 
&6(w) = R(w) cos E f- iy Bat saat 


Tc (ge een DReereerine | 
4 Jo V(B-—cost)(A — cost) 
R(w) = |(B — cos w)(A — cos w)|—™. 
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The function Ai(x) is the Airy function defined in Ref. 10, page 446. The 
parameter C here is given by 


_4[N —1)k) = 
iad Peace | (45) 


rem 27 


where [x]rem 27 = X — 2a|x/2a] is the number between zero and 27 
congruent to x modulo 27, and the parameter 6 in g¢ is 


ax WN C 
Yen eles precy 46 
F ee |. 20) 


The L’s are given by 


i= "Pde La = f. * Q(@)dé 
Ls= f Pde = La= f. ” @@dé (47) 


A A 
Ls= f P@dt Le= f Q@)dE=Ls 
eee oe 
(& — A)(1 — &) 
The L’s can be expressed simply in terms of complete elliptic integrals 
of the first and third kind. (See Ref. 13, pages 242 and 265.) (The inte- 
grals in (44) can also be expressed in terms of elliptic functions, but the 


resulting expressions shed no light on the nature of the solution.) Finally, 
the d’s in (42) are 


d, = Lb? qi/2Q 3/2 

do = Ly? 721/3(1 — B2)-1/12(B — A)-V/3N 1/6 
dg = Ly! q/2Q-1/2 

dg = Ly? x(1 — A2)~'/4e~CLal4e -NL3/2N 1/2 


dg = i qi/29 1/29 —CL 4/49 —NL3/2 (48) 


/ 
P(g) = ” Q(t) = |(E- BYE — ANA — 2)|-¥2. 








C. U;(N,W;f) for large N andk =|2WN + (b/z) log NJ 
When N — - and 


k =|2WN + (b/r) log N| (49) 
with b and W fixed, we have asymptotically in N 
€1hz(), 0 < w Sarccos [A + N-24] 
Uz (N,W3f) ~ { echo(w), Jcos w — A] < N-2/3 (50) 
esh3(w), arccos [A — N-2/3] <w < x. 
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Here 


611+ A)tan=+1 


1 N Eg 2 T 
= — —— | —k- 
hy(w) a cos 9 wt log 9 


BQ + A) tan 5-1 


ho(w) = ei(8/2)N(cos w— A) (5 -i =. 1; -i8N(cos w — A)) (51) 
he(w) = ——— 
= 
: A — cos w 
@ 
1+ A)tan-—+1 
N Eg B( ) tan : 
X cos gets lee SSS aS 1)5 
B(1 + A) tan ~ 1 
B = |csc 2x W| (52) 
and 


ax a(a+ 1) x 
el! c(c+1) 2! 7° 
is the confluent hypergeometric function in the notation of Ref. 9, Vol. 


1, Chapter 6. The constant E is to be determined as the root of smallest 
absolute value of 


@(a,c;3x) =1+ 


Eg 2N v OT 
N + — log — + Y(EB) —-k-~-—-—=0. 53 
aW 9 108 B Y(EB) 5 4 (53) 
Here we have written 
Tr (5 _ is) = r(s)es) (54) 


where r, y, and s are real and I is the usual gamma function. 
The constants in (50) are given by 


Sar 1/2 
= (—7)LR/2) | —_—<<—<_—_—_—_______ 
aa Fe + e7£8] log x 
eo = r(EB)VBNe/4EBe, (55) 


D. U;(N,W;f) for large N and k = |2WN(1 + 6] 
The case of large N with k = |2WN(1 + €)],0 <e<1/2W —1 canbe 
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reduced to case B above by means of the formula (13). One finds 


1 
Ue (N,Wsf) = KWNA)Uw (NW; —f) (56) 
where U;:(N,W’;¥2 — f) can be obtained from (41)-(48). Here 
W = Pe W 
2 
k’=N-1-k~2W’N(1- ©) (57) 


E. Uy-—e(N,W;f) for fixed @ and large N 
Formula (13) reduces this case to case A above: 


1 
Un-e(N,Wsf) = KINN ~ £)Ue- (N, ; -W;5-f) 


where formulas (38)—(40) can be used to obtain asymptotic values for 
Ue-i(N 2 — W32 — f). 
2.5 Asymptotics of the eigenvalues ),,( N,W) 

For fixed k and large N, one has 
1 — Ag (N,W) ~ w1/2(RI)-1QC4R +A gy 2k+V/ATO — q]—(k+1/2)NR+1/29 —yN 

2a 
V2—-Va 
Some values computed from this expression are shown as dotted lines 
on Figs. 5 and 6. The fit with Xo is very good for N = 2 when W = 0.4 and 
for N = 6 when W = 0.2. 

For large N and k with 

k=|2WN(1 - 9], 0<e<1, 
1 — A, (N,W) ~ e~Ch4/2e-LaN (59) 

Here the L’s are given by (47) with B and C determined from (43) and 


(45). 
For large N and k with 


k =|2WN + (b/r) log NJ 


a =1-—cos2rW, y = log | 1+ (58) 





Ap (N, W) ~ . 60 

e(N,W) ~——— (60) 

A good approximation to A; (N,W) when 0.2 < A < 0.8 is given by 
Ne(N,W) ~ [1 + 74)“ (61) 
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where 


_ _1 
i 2a |NW_ k/2 — 4) (62) 
log [8N| sin 2xW]] + y 
where y = 0.5772156649 is the Euler-Mascheroni constant. Some values 
of (61)—(62) are shown on Fig. 3 for k = 7 and 13 and on Fig. 4 fork = 3 
and 6. Near \ = 4 the discrepancy between the true value and the for- 
mula (61)—(62) cannot be seen on the scale of Figs. 3 and 4. 
Asymptotic values for \;(N,W) with N large and N —k = @ fixed can 
be obtained directly from (13) and (58). In a similar way, (13) and (59) 
provide an asymptotic formula for \,(N,W) when k = |2WN(1 + «)], 
0 <«<1/2W — 1. One has in this case 


AR (N,W) ~ e~Cha2e-LaN (63) 
where C, Ls, and L4 are to be computed from (48), (45) and (47) with W 


replaced by 44 — W and k replaced by N —k — 1. 
For fixed k and N, but W small, we find 


\.(N,W) = - (27 W)2k+1G(k,N)[1 + 0(W)] (64) 


where, for example 


G(0,N) =N 


G(LN) = ag iN _~yN(N +1 


G(2,N) = —~ (N — 2)(N — 1)N(N + DIN + 2) 


8100 
92N—2 
G(N — 1,N) = aN : (65) 
— 23 
oN -1 ( ) 
( ) N-1 
The general term is 
22k (k1)6 k ; 
Gk,N) = EY __ Fh ow-j. (66) 


(2k + 1)?[(2k)!]* j=—n 


For fixed k and N, but W near '/, i.e.,44 — W > 0 small, (13) combined 
with (64) gives 


1 — Az (N,W) 


=-+@(N-1-k,N)[r0 — 2W))2--[1 + 001 — 2W)].. (67) 
v 
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2.6 Relationship to PSWF’s: W—> 0, N— ©, rNW—c> 0 
The prolate spheriodal wave functions (PSWF’s) y;(c;x) and their 


associated eigenvalues A;(c), 1 = 0,1,2,... are defined by 
1 sin c(x — x’) 
fe Mile de’ = LOC), (68) 
-1 a(x —x’) 
—~x<x<aw 
No >A, > dro... 
J Wlex)dx = 1, ¥4(0) 20, ¥i(0) 20 (69) 
i =0,1,2,.... 


For each i = 0,1,2,... the PSWF y;(c;x) satisfies the differential equa- 
tion 
ad (1 — x2) dy; 
dx dx 


for a special value 





+ [x —c*x?]y; = 0 (70) 


X = xile) (71) 


of the parameter x. The PSWF’s and the quantities \;(c) and x;(c) are 
discussed in detail in Refs. 1-6. 
Now let c > 0 and y, a real number, be given. If, as 


w-0,N=|——| andn =| +9) (72) 
then 
\i(N,W) ~ Aj(c) 
VW U.N, W; WY) ~ vi (c.f) (73) 
‘N (i) tl divs 
9 Un (N,W) Vite ie) (74) 
N2 — 1 — 20;(N,W) ~ xi(c). (75) 


In (74) when i is even the plus sign is taken when f1,W;(c;x)dx > 0; if 
i is odd, the plus sign is taken when f1,x¥;(c,x)dx > 0; otherwise, the 
negative sign is to be used. 
ill. APPLICATIONS 
3.1 Extremal properties 

3.1.1 Most concentrated bandlimited sequence 


To maximize (6) over the bandlimited sequences, we replace h, 
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there by its representation (5) and use (4) to obtain 
Ww W ~ . 
f df f df’H(f)H (f’)e~27in¢-1) 


NotN-1 


n=N 
= o 


Ww 
ff, HHO Pat 
-w 


W W ee ey nena) SNA 79) yt 


Ww 
{_, HHOPar 
-W 


poe sin Nr(f — f’) — 
as af fa oe Map 


(76) 
i Vp leaf 
-w 
Here we have written 
V(f) = eit 2NotN-DIH (f) (77) 
and used the fact that 
ae e-2nin({-f’) = e-in(2NotN-V(f-f’) sin Na(f — f') 
No sin x(f — f’) 


A simple variational argument applied to (76) shows that A is stationary 
when y satisfies (7) and hence the maximum value of J is Ao(N, W), at- 
tained when y¥(f) = cUo(N,W;f), |f| < W. Equation (77) then shows that 
the most concentration bandlimited sequence is 


ceit2Not+N-DF Ud(N,Wf), |f| < W 
hn} <> H( \o (78) 
a aa) W<Ifl <%, 
For the sequence itself, we then find from (5) 
Ww 
hn =¢ ff. ss Ud(N, W;f)eizIN-1-22—-No)fldf 
whence from (29) 
{hn} = div)v(N,W)} (79) 


where d is independent of n. The results (78) and (79) were stated in 
Section I after eq. (6). 

More generally, for k = 1,2,...,N — 1 we have that d{u*)y,(N,W)} 
is the bandlimited peauence most concentrated in (No,No + N — 1) that 
is orthogonal to {uf NAN, W)},t =0,1,...,& — 1. The fraction of its en- 
ergy in the range (No,No + N — 1) is Np (N, W). 
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3.1.2 Indexlimited sequence with most concentrated spectrum 


If {h,,} is indexlimited, so that 


0, n<No 
hn = hn, Noxn<Not+tN-1 
0, n>NotN-1, 


and if H(f) = {h,}, then 


mee NotN-1sin 2xW(n —m), ~— 


Ww 
§_,, HHOLas 
—-W = 


= ra n=No  m=No a(n — m) ee 
ery NotN-1 
f_, HORaf y |Aal? 
—1/2 No 
N-1N-1gin 27W(n — m) - 
EN plneeNG 


= n=0 m=0 a(n —m) - (80) 


N-1 
2 [hn+Nol” 


Here we have used (2) to replace H(f) in the numerator, and have used 
(4) to rewrite the denominator. The quantity yu is a natural measure of 
the extent to which H(f) is concentrated in the frequency interval 
(—W,W). Comparison of the right member of (80) with (18) shows that 
y will be a maximum when hn+Ny = cv.(N,W), n= 0,1,...,N — 1. Thus 
the indexlimited sequence with most concentrated spectrum in ~W < 
f<Wis 


0, n < No 
{hn} = vO) no(N, W), Noxn<Not+N-1 (81) 
0, n>NotN-1. 


The concentration of its spectrum H(f) is u = Ao(N,W) and 
H(f) = dUd(N,W;f)ei™2NotN-Di Wf (82) 


with d independent of f. 

More generally, for k = 1,2,..., N — 1, IN@*% "vu \(N,W)} is the 
indexlimited sequence with most concentrated spectrum in -W <f < 
W that is orthogonal to 


INO+N-ty V(N,W)} i= 0,1,...,k—1. 
The fraction of its spectral energy in |f| < W is \;(N,W). 
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3.1.3 Simultaneously achievable concentrations 


Let {h,,} <> H(f) and consider the two measures of concentration 


Not+tN- Ww 
She)? f, H@Pas 
qe ae FY ae eae 
2 2 
> Ira {HHO df 


What values of a and 6 are possible? 

Just as in Ref. 2, pp. 74—77, one finds the attainable nonnegative values 
of a and @ are given by the intersection of the unit square 0 < a <1, 
0 <8 <1 and the elliptical region 


a? — 2aBV \o(W,N) + B2 <1 — Ao(W,N). 


The elliptical boundary cuts the square at a = 1,8 = VdAg(W,N) and a 
= V\o(W,N), 8 = 1. As either N gets large, or as W — Yo, Ao(W,N) > 
1 as seen by (58) and (67), and the attainable region becomes the unit 
square. 


3.1.4 Minimum energy bandlimited extension of a finite sequence 


Let numbers ho, hi, ..., hn—-1 be given. There are infinitely many 
ways that one can choose numbers hy, hy+i,... and hy, h—»9,.. . so that 
the infinite sequence {h,} is bandlimited. Which of these sequences has 
least energy? 

The answer is 


N-1 ; 

hin = > ajvy)(N,W) (83) 
j=0 

n=0,+1,+2,... 
where 

N-1 

aj = 5 hyv(N,W) (84) 
n=0 


J =O lsu =A. 


The energy of this bandlimited sequence is 


py ies ye. (85) 
iF d,(N,W) 


The dual to this problem is the following: Let H(f) be given for |f| < 
W. Consider extensions of H to the interval |f| < 14 that correspond to 
sequences {h,} <> H(f) that are indexlimited to the index set (No, 
No + N — 1). Which such extension has least energy? 

The situation is quite different here from the dual just discussed. Given 
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an arbitrary H(f), |f| < W, in general there is no way to extend it so that 
the corresponding sequence will be indexlimited. The extension can be 
accomplished only if for |f| < % 


Not+tN-1 ; 
H(f) = oa hne Pan 
No 

or stated another way, only if for |f| < W we have 

N-1 

H(f) = e?#i\Not N12 ag U,(NWif). (86) 

j= 

Then, of course, 
Ww 
aj = ~ f H(f)e~27!Wot(N-D/2U;(N, W;f)df 
j J-w | 


by (11). But (86) for |f] < 14 is then the extension sought of minimum 
energy. Its energy is 


1/2 N-1 
f |H(f)|2df = > a}. (87) 
—1/2 J=0 


The distinction between the two cases just treated arises, of course, 
because the Hilbert space of indexlimited sequences is finite dimensional 
while the space of bandlimited sequences is of infinite dimension. 


3.1.5 Trigonometric polynomial with greatest fractional energy in an 
interval—optimal windows 


Let g(f) be a function of the form 


N-1 
BG) = Do gpe te, (88) 
k=0 
If N = 2M + 1 is odd, this can be written 
M ; 
a= DY Brent (89) 
n=—-M 
and if N = 2M is even it can be written 
M A . 
gip= 3d . Baeitern (90) 
n=—(M-1) 


where g, and %,, are suitably defined. In either case g(f) can be called 
a trigonometric polynomial. 
For functions of form (88) one readily computes 


Ww N-1 
f lg(f)|2df > P(N, W) mn&mEn 
—-W = n,m=0 


(91) 
1/2 N-1 
f_,, lear > én? 
—1/2 0 
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with p(N,M) given by (21). Comparison of (91) with (18) and (26) shows 
that Uo(N,W;/f) is the trigonometric polynomial of form (88) having the 
largest fractional concentration of energy in (—W,W). 

Applications of this fact have been made to digital filtering,12-1618 to 
spectral estimation,!” and to the definition of an essentially band-limited 
process by Balakrishnan? in 1965. In most of these applications, N is 
odd, and the g, of (89) are required to be real and even in k. Thus for 
functions of form 


M 
g(f) = 2a, + 3 a; cos 2ajf 
1 


with the a’s real, one desires to choose the a’s to maximize the fraction 
of the energy of g in (-W,W). The answer is 


aj =v\?_ ;(2M+1,W) j=0,1,...,M 
and 
g(f) = cU,(2M + 1,W;f). 


This basic property of U, can clearly make it of special interest in many 
fields. 


3.2 A prediction problem 


N successive samples spaced Ty seconds apart are taken from a sta- 
tionary white noise X(t) of bandwidth Wo and mean zero. The linear 
predictor formed from these samples that has minimum mean-squared 
error is used to estimate the next sample value of X(t). What is the 
mean-squared error of this prediction, and how fast does it decrease with 
N? 

We write X; = X(jTo). Let the observed samples of X(t) be Xo, Xi, 
...,Xyn-1. Then the predicted value X of Xv is to be of the form 


- N-1 
X= ¥ ajX; 
0 


where the a’s are chosen to minimize n = E(X — Xy)2. The solution to 
this problem is well known. (See Ref. 11, pp. 302-305, for example.) The 
least value possible for 7 is 


AN+1 





no = min 7 = (92) 


a’s N 
where A¢ is the £ X @ determinant whose entry in row? and column J is 
EX;Xj,1,j = 0,1,..., @— 1. For the white noise case at hand this entry 
is 
2 sin QrWoT oli = J) = o? 


: — (NN, WoT 0); 
2rWoT oli — j) oWoTo” oP oii 


1394 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1978 


in the notation of (21). Here o? = EX(t)? is the noise power. Since the 
determinant of a matrix is the product of its eigenvalues, it follows 
that 

N 

I] An(N + 1,WoT) 
0 


gz 


~ 2WoTo 





n0 (93) 


N-1 
IT An (N, WoT) 


We can now use our knowledge (Section 2.5) of the asymptotics of the 
Ar (N, W) to find the behavior of no for large N. It is shown in Appendix 
A that for 


1 
Os: Wolo < 52 


lim a log no = log (sin tWoT)?. (94) 
No lN 
Thus the mean-squared error of the best linear prediction vanishes ex- 
ponentially in N when the sampling rate 1/T9 is greater than the Nyquist 
rate 2Wo. The exponent decreases in absolute value towards the limit 
zero as the sampling rate is decreased to the Nyquist rate. 

The situation is very different when WoT'y > 4%. Then no approaches 
a limiting positive value, n., as N gets large. We find (see Appendix A) 
that 





2 2WoTo- 
a2 (1+=) oe (95) 


WoT n 


n n+1 
—<WoTo< y WH D2 ee. 
2 en 


lim 79 = 70 = 
No n 





A plot of 7. for 4% < WoT'o S *% is shown in Fig. 8. Examination of (95) 
shows that 7. = o? for WoT = n/2,n = 1,2,...., and that the loops be- 
tween these values shown in Fig. 8 get smaller and smaller as Wo7'p in- 
creases. Thus, while 7.. is zero for all sampling rates greater than the 
Nyquist rate, 7.. > 0.94 for rates less than 1/2Wo. 

The foregoing is, of course, an unrealistic model of a physical predic- 
tion scheme in that it assumes perfect knowledge of the samples. If one 
assumes that to each sample X (JT) an independent observation noise 
Y; is added, then the linear predictor takes the form 


. N-1 

X= x aj(X; + Yj): (96) 
If we assume that X is a prediction of the noisy next measurement 
X(NT) + Yn, then all proceeds as before with the matrix p replaced by 
p + (2WoT ou/o?) I where » = EY? and J is the,unit matrix. By replacing 
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1.0 


0.94 
0.5 1.0 1.5 2.0 2.5 


WoTo 


Fig. 8—Best mean squared error y~ vs. WoT'o. The noise variance o? = 1. 


Ar (N,WoTo) by 2WoTou/a2 + Ax (N, WoT’) one finds readily that 
z o2s 1 2WoTo-n 
~ 2QWoT ( ) 





Noo 
s 


s=52WoTo +n (97) 
oO 


ef 
Pel aera n=0,1,2,... 


When n = 0, so that sampling takes place faster than the Nyquist rate, 
No is positive. Indeed, 7. rises monotonically from the value pu at T'5 = 
0 to the value » + o? when WoT = ‘4, as might be expected; perfect 
prediction is no longer possible. 

A more satisfying model would add independent noise to the observed 
samples, but require X to be a best linear predictor of X(NT) itself, 
rather than of X (NT) plus noise. The asymptotic behavior of 7 in this 
case seems more difficult to obtain. A related problem is readily solved, 
however. 

Let X as given by (96) now be a minimum variance estimate of Xy-1, 
where as before the Y; are independent identically distributed random 
variables that represent the imprecision of the measurement process. 
We are now not trying to predict Xy but rather to eliminate the noise 
and estimate Xj-1 correctly. One then finds for the mean-squared 
error 


N-2 
it [d + Ax (N — 2,WoTo)] 


n =nl1—-¢ eee (98) 
nus [o + An (N — 1, WoT o)] 


where 


2WoTo 
= 9 L. 





ob 


o 
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Again using the techniques of Appendix A, we find in this case that 


g 1 is 
= = + 
‘ ft cen aN, e*) 


n+1 











5 < WoT s n= 0,1,2,... 


Equation (99) can be obtained as a special case of a filtering problem 
solved by Viterbi.2° He uses the result of Szeg6 that 





fi EO ap f log HYf)af (100) 

N--o QN —1/2 
where Qy is the determinant of the N X N Toeplitz matrix having h;_; 
as the entry in the ith row and jth column. Here, as usual, {h,,} <> H(f) 
and we require that h_,, = hn, so that H(f) is real. Szeg6’s result can in- 
deed be applied to the ratio of determinants in (92). The Poisson sum- 
mation formula, Ref. 14, p. 466, gives 


HO~ aaa e* Cw ) 


for this case where x(f) = 1 if |f| < 1 and zero otherwise. Carrying out 
the details one finds (95) again, but finds only that 





lim no = 0 
N->o 


when WoT’ < 4. Our detailed knowledge of the \’s has permitted cal- 
culation of the rate at which no approaches zero as expressed in eq. 
(94).* 


3.3 The approximate dimension of signal space 


The DPss’s {v*)! k = 0,1,..., N — 1 are bandlimited to (—W, W) (see 
(33)). The concentration of {v{"} is given by 
E(0,N — 1) 
E(-©,0) ° 
[see (22)]. From the results of Section 2.5 we have seen that as N > , 
An > Lif k = 2WN(1 — «), while if k = 2WN(1 + ¢), A, — 0. And this is 
true for any « satisfying 1 > « > 0. Thus a fraction arbitrarily close to 2W 
of the bandlimited DPSS’s are confined almost entirely to the index set 


AR(N,W) = R= O1.25N 1 


* Note: After the work in Section 3.2 was completed, it was called to my attention that 
Widom?”? has derived an important extension of Szegé’s theorem which applies to the case 
at hand here and gives the stronger result yo ~ k[sin tWoTo|2% with k given explicitly to 
replace (94). The derivations of (94) and (95) given in the present paper are felt to be of 
interest in their own right and serve to verify the accuracy of the results given in Section 
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0<n <N -—1. The remaining DPSS’s have almost none of their energy 
in this index set. 

The facts just noted can be summarized loosely in the statement that 
“for large N the set of sequences of bandwidth W that are confined to 
an index set of length about N has dimension approximately 2WN.” This 
basic intuitive notion can be made precise in a number of ways. We prefer 
the following method which treats bandlimiting and indexlimiting 
symmetrically. 

Denote by J the index set J = {0,1,... , N — 1}. Now let « > 0 be given. 
Denote by G, the set of finite-energy sequences {h,,} <> H(f) for which 


E7= X |hn|? Se (101) 
né¢I 
and 
Ew= |H(f)|2df < «. (102) 
1/22 |f|>W 


If « is small, members of G, have little energy outside the index set (0, 
N — 1) or outside the frequency range (-W,W). Now let M = 
M(N,W,«’) be the smallest integer such that there exist fixed sequences 
{gD} fo} fo) such that for every {g} © G,a’s can be found for 
which 
N-1 M 12 
= | en = > at? | oe. (103) 
n= 1 
In words, M is the dimension of the smallest linear space of sequences 
that approximates G, on the index set (0,N — 1) with “energy” error less 
than ¢’. 
With these definitions out of the way, the key theorem on the di- 
mension of signal space can be stated as follows. 
Theorem: If ¥.2 W>Oand ¢ >«>0, then 


. MI(N,W,6€’) 
lim —————— = 


2W. 104 
Py N (104) 


Proof of this theorem follows very closely that given in the Appendix 
of Ref. 7 and will be omitted here. 

For applications of this theorem it is important to note that the DPSS’s 
{vu}, fork =0,1,...,2NW(1 — n) for suitable choice of 7, can be used 
as an orthogonal basis for the M-dimensional space of sequences that 
best approximates G, in the sense of (103). Thus if N is large and one is 
dealing with sequences known to be approximately of bandwidth W and 
very small outside the index set (0,N — 1),2WN numbers suffice to de- 
scribe the sequence—namely, the first 2WN coefficients a; in the ex- 
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pansion of the sequence {h,,} on the appropriate DPSS’s. We have then 
2WN-1 
hn > ajv{?(N,W) (105) 
0 
and 
-5 h,v?(N,W). (106) 


Of course when N is large and W < 4,2WN <« N so that the savings can 
be considerable. 

Possible applications of the foregoing ideas to picture processing, 
cryptography, bandwidth compression and other sampled data systems 
should be evident. In many such applications, one starts with a signal 
x(t) € L2(—,) defined for all time. A sequence {h,,} is derived from 
x(t) by sampling at rate 1/T9 so that 


hn =x(nTo), n=0,1,.... (107) 
If X (f) is the spectrum of x(t), so that 


x(e)= f X(feraf, (108) 
then 


Hf) = ¥ hpe2tinf = > e2eint (7 X(fe2xin Tol ay 


i f-n 
“TEX (G) 10 
by the Poisson summation formula (Ref. 14, p. 466). If now X (f) vanishes 
for |f| > Wo and if Ty < 1/2 Wo, then H(f) = 0 for W’ < |f| < % where 
W’ = WoT < 14. Thus when signals are sampled at rates greater than 
the Nyquist rate, the DPSS are of particular value in providing a succinct 
method of describing N-vectors of samples. 

An interesting application of these ideas forms part of a digital 
transmission scheme invented by Wyner to be described in a forthcoming 
paper by him. 


IV. DERIVATIONS 


4.1 Basic facts of Section 2. 1-—2.3 


An orderly development of this subject is facilitated by a few com- 
ments about the operators 


jue A es sin Na(f — f’) 
a Sia sin r(f — f’) oe 
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and 


= ap os Qf — A) 5+ (NP 1) cos 2af (111) 
that appear in (10) and (16). As before, we take 0 < W < 4, but new allow 
N to be an arbitrary real number. Operators of the type (110) and (111) 
have been well studied in the past and we borrow freely from the liter- 
ature. 

The kernel 
sin Nx(f — f!) 
sin r(f — f’) 

is real, symmetric and square-integrable over the region |f| < W, |f’| 
< W. The characteristic equation, Ly = dy, therefore has as solutions 
a set of real eigenfunctions Yo, ¥1, Yo, . . . that are orthogonal on |f| < W 
and complete in L2(—W,W). The corresponding eigenvalues are real, 
and those eigenvalues that are different from zero have a finite degen- 
eracy. The eigenfunctions and eigenvalues are continuous functions of 
the parameter N. The kernel of the operator L in (110) is defined for all 
values of f. The domain of definition of eigenfunctions of L belonging 
to non-zero eigenvalues can then be extended to the whole line —~ < f 
< » by means of 


KGS] = (112) 


i 
y= > Ly. 


These eigenfunctions are readily seen to possess continuous derivatives 
of all orders. The eigenfunctions belonging to the eigenvalue zero can 
also be chosen to have derivatives of all orders. 

The characteristic equation for M, 


MU = 0U, (113) 


is an example of the well studied Sturm-Liouville equation (Ref. 14, p. 
719). Let us denote by @/ the class of function continuous on |f| < W and 
piecewise twice differentiable there. Then (1138) has solutions in @ only 
for a discrete set of real values of 0, the eigenvalues of M, say 9) = 0; 2 
6. =...and acorresponding set of real eigenfunctions Uo, Uj,... can 
be found that are orthonormal, i.e. that satisfy 


Ww 
f. 4, UMN Pdf = by. (114) 


Furthermore the U;’s are complete in £2(—W,W). 

For our particular M, (111), all the eigenvalues are non-degenerate. 
For, suppose U; and U; are linearly independent continuous solutions 
of (113) belonging to the same eigenvalue 0. From MU; = 0U; and MU; 
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= 60U; we obtain U;MU; — U;MU; = 0 or 





dU; 

i a of ay ae 2af — A) af 
a py, ui _ py, du] 
= 4; (00s 2af - ay{u ia U; ma 0. 


Integrate this last equation from f = —W toa generic point f’ with -W 
<f’ < W. We find that 


dU;(f’) dU; (f’ 
U;(f") ae mf?) a ) 0, -W<f’<W, 
which contradicts the assumed linear independence of U; and Uj. 
The non-degeneracy of all eigenvalues permits us to write 


09 > 0, > 02>... (115) 


It follows then from well-known theorems that U;, (f) has exactly k zeros 
in the open interval |f| < W. That an eigenfunction U cannot vanish at 
either f = W or f = —W follows directly from the differential equation. 
For if U vanishes at f = W, for instance, 

d7U 1. dU 
(cos 2xf — A) —— df —5_ sin amt ae 


fl 
+ pve 1) cos 2af — | U=0 (116) 





= )? 


evaluated at f = W shows that dU(W)/df = 0. Differentiate (116) and 
evaluate at f = W to see that d2U(W)/df? = 0. Continued differentiation 
shows that all derivatives of U vanish at f = W. But U possesses a Taylor 
series about f = W and so the assumption that U(f) = 0 leads to the 
conclusion U = 0 which cannot be. Thus U(W) + 0. 

We now know that both L and M possess orthonormal sets of eigen- 
functions belonging to U@/ that separately span £2(—W,W). We show in 
Appendix C that L and M commute, i.e. for all g(f) € U, LMg = MLeg. 
It is not hard to see then!® that one can find a single set of orthonormal 
functions in U complete in £2(—W, W) that are simultaneously eigen- 
functions of L and M. Because of (115), however, the normalized ei- 
genfunctions U;,, k = 0,1,...of M are unique except for sign. Thus the 
normalized solutions of (16) in 2, ordered by (115), are a complete set 
of eigenfunctions of L as well. 

Any continuous solution to (113) in |f] < % can be written as a Fourier 
series 


U(f) = ef N-DF 3 c,e2zinl, 
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Substituting this form in (113) we find the 3-term recurrence for the 
c’s 


1 N-1 é 
5 UN — nena + E oma. -6 Ch 
+ > (n + 1)(N sh 1)en+1 = 0, (117) 


n=0,+1,... 


Note that the coefficient of c,—-1 here vanishes if n = 0 orn = N, while 
the coefficient of c,4+1 vanishes forn = —landn =WN — 1. Thusif Nis 
a positive integer, which is the case of primary importance to us, the 
infinite system of equation (117) uncouples and we see that a solution 


is possible with 0 = c_;} =c-9=...=¢Cn = CN+1 = CN+2Q =... provided 
that 
-1 
~ a(N,W)ijc; = Oc; (118) 
j=0 


L=O,L<0.y V1 


where the real symmetric matrix o(N, W) is given by (14). Such a matrix 
has N real eigenvalues, which we now see to be eigenvalues of M as well. . 
We denote them by 6;,, 6;.,..., 0iy. From (115) we know that 11, i9,..., 
iy are N distinct non-negative integers. We denote the real eigenvector 
of o(N,W) corresponding to 6;; by 


yi) = (vf?(N,W),v{(N,W), ..., of1(N,W))? (119) 
J =1,2,...,N 


and suppose these vectors normalized so that 


Phare GR : 
2 VD )(N, W)v$”)(N,W) = jks j,k a 1,2, ee: N. (120) 


We denote the corresponding eigenfunction of M by 
N-1_ 
Ui(N,Wf) = XO vf (N,Wheit(N-1-2n)f, 5 = 1,2,...,N. (121) 
n=0 


Now again let N be a positive integer and denote by vy the finite- 
dimensional space of functions of form 


N-1 
BUR) ene (122) 


n=0 


where go, £1, ..- ,8N—1 are arbitrary complex numbers. We have just seen 
that if N is a positive integer, M leaves ¢y invariant. Indeed, a simple 
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calculation shows that if g is given by (122), then 


N-1 
Ms=e()= eee ro! (123) 
n=0 
where 
; ee 
&m = x oN, W) mn8n- (124) 


With N a positive integer, L also leaves ¢y invariant. (Indeed, in this 
case L projects all of £2 onto Sy.) If g is given by (122), one readily finds 
that 


N-1 
Lee) =) ee el (125) 
n=0 
where 
* N-1 
§m= Ds P(N,W) mn8n (126) 
n=0 


and the N X N symmetric matrix p(N, W) is given by (21). This is most 
easily seen from the fact that for integer N the kernel (112) is degenerate. 
Specifically, 
SEA! Bee A] = S eit(N-1—-2n) fp —ia(N-1-2n)f" (127) 
sina(f—f’)  n=0 


Since L and M commute, so do the matrices p(N,W) and o(N,W). 

We now show that for integer N the eigenfunctions of M spanning Gy, 
namely U;,(N,W;f), J = 1,2,...,N, belong to the N largest eigenvalues 
of M, namely, 90, 01,... , 9n-1. We order the integers 1; so that 0;, > 9:5 
>...> 96;y, so that our task is to show that i; =j — 1,7 = 1,2,..., N. Now, 
if 0’ and 6” are two eigenvalues of M with 0” < 6’, the eigenfunction be- 
longing to 6” must have at least one more zero in |f| < W than the ei- 
genfunction belonging to #’ (see Ref. 14, p. 721). It follows then that U;,, 
must have at least N — 1 zeros in |f| < W, since the smallest number of 
zeros U;, could have in |f| < W is zero. But Uj, cannot possibly have 
more than N — 1 zeros in this interval, since, from (121), we can write 


Uiy = eit N-DS bos pingn 2 = e-2nif 
n=0 
which shows U;,, to be a function of modulus unity times a polynomial 
of degree at most N — 1. It follows then that U;,, has exactly N — 1 zeros 
in |f| < W, whence Uj, has precisely j — 1 such zeros j = 1,2,..., N. It 
then follows that 0;, = 9 the largest eigenvalue of M, 0;, = 91, the next 
largest eigenvalue, ... 0; = 0n—1. Q.E.D. 
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We have now shown that when N is an integer, the eigenfunctions of 
M that span 9y are 


Nets, 
Ui(N, Wf) = X vP(N,W)eitW-1-2n)f (128) 
n=0 
where the v™) are normalized eigenvectors of o(N,W): 
-1 : ; 
X o(N,W)nmvi(N, W) = 6:(N,W)v?(N,W) (129) 
m=0 


in=0,1,...,N—1. 


These U’s are also eigenfunctions of L and from (126) it then follows 
that 


N-1 sin 27W(n — m) 
m=0 a(n — m) 
in=0,1,...,N—1. 


v®(N,W) = \;(N,W)v?(N,W) (180) 


The matrix p(N,W) is positive definite, since 


N-1 = WwW : a" 
S p(N,W)nménEm =X f. (deem E En 


n,m=0 
WwW 
i 


which is positive unless all the £’s are zero. Thus the A; (N, W) in (130) 
are all positive. 

We have defined the U; as eigenfunctions of M and have ordered them 
so that (115) is true. These same U; are a complete set of eigenfunctions 
of L and we define ); to be the eigenvalue of L corresponding to U;. We 
shall show next that the non-zero eigenvalues of L are non-degenerate 
and that when N is a positive integer 


Ao(N, W) > AVN, W) >... > Aw-1(N,W) > 0. (131) 


2 


N-1 . 
yy £,e mint dt 
0 








The proof that if \ ~ 0 then A is non-degenerate can be made exactly 
as in Ref. 1, equations (380)—(39). The assumption that two independent 
eigenfunctions of L, say U, and U,,, belong to the same eigenvalue \ # 
0 leads to the conclusion that 6,, = 6, which we have shown to be false. 
The reader can find details of the proof in Ref. 1. 

We note next that for integer N, U,(N,W;fW) — cxPz(f), |f| < 1, as 
W — 0 where P; (f) is the Legendre polynomial of degree k. This follows 
directly from the differential equation (16) which for small W be- 
comes 

me -f?) a xUn(N,WsfW) + 0(W2) = 0 
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where x = 14(N?2 — 1) — 20. Thus 0, (N, W) = 44(N2—-1) — Yk(R +1) 4+ 
0(W), k =0,1,...,.N — 1. Now the argument of Ref. 1, pages 61-62 holds 
again and it follows that for sufficiently small positive W, (131) holds. 
Since for integer N and 0 < W < 4 these \’s are non-degenerate and are 
continuous in W, it follows that (131) holds for 0 < W < % which is stated 
as (8). 

Proofs of the remaining claims of Sections 2.1—2.3 are all of a more 
elementary nature. Most involve a straightforward calculation. We leave 
the details of the verification of these claims to the reader. 


4.2 Asymptotics of the differential equation 
We now consider solutions of the differential equation 
d dU 1 
4 leos w 4) + | F(N2- 1) cos o—0| U =0 (132) 
dw dw 4 
for 0 < w S a when N is large and 


40 = BN2+ CN + > DjN-i (133) 
'=0 


J 


where B, C and the D’s are assumed independent of N. The substitu- 
tion 


G 
piss 134 
Vcosw—A ee 

gives 

d?G_ . N2(cos w — yo)(cos w — y1) 

Ga 1 

dw? 4(cos w — A)? ve 
or 





cot |N cosw—B 
dw? 


Here 


C 
4(cos w — A) = 4(cos w — A) = 0) | als ONO) 


1 1 
=B+0(-), =A+0(=:). 1 
yo N y1 N (137) 


Case A.1>B>A>-—lork =|2WN(- 9] 


If1>B> A> -1, then, as seen from (135) and (137), U is oscillatory 
for 1 = cosw 2 B and for A = cos w = —1, but is non-oscillatory in the 
interval B = cos w 2 A. We investigate the solutions of (132) separately 
in each of these regions and also in the vicinity of the turning points yo 
and yj. 
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Let cos w — A = t/N2. Then (132) becomes 


d?U dU B-A 1 
arcane Ja =0 
dt? ae ae dt 4(1—-— A?) N 


so that near cos w = A we have 


U,=d4lpo (N ayy 080 4)), cosw2A 
Uw 
Us = dado (N aaa — cos w)), cosw <A. 


(138) 


Here Jo and J are the usual Bessel functions. We note that when cos w 
=A+u/N 


Us = dalo (nv V 2-4.) 


(1 =A?) 
~—24_| yi4/(Ba=A) |e 124 / {B= A) 
Vege Vaan] eo (MOV as") 
(139) 


(see Ref. 9, Vol. II, eq. 7.13.5, p. 86). When cos w = A — u/N 


(B — A) 
Sai (NV? Ae ) 
Us = dado (1 —A2) u 
2 (B — A) uf | (B — A) “| 
aed N12 Nie Ay ee 
Vi | Veo — A?) os Vas 4 
(140) 
(see Ref. 9, Vol. II, eq. 7.13.3, p. 85). 
Now, the WKB solution of 
2 
oe [n2B2(x) + nF(x) + O(1)]g = 0 (141) 


dx 


for large n is 
1 
g(x) ~ FE [cye~nS Edx—1/2S(F/E)dx 4 egenS Edx+1/2S (F/E)dx] 
E 


(142) 


provided x is not a zero of E(x). (See Ref. 8, Sec. 7, Lemma 2.) Applying 
this to (136) and taking account of (134), we find that for B > cosw>A 
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an asymptotic solution of (132) is 


N w B-cost 
= ————— d 
a= aatt @) oe (= 2 arccos B Veen t-A : 
1 w Cdt 


Ta) neceae BAT (B — cos t)(cos t — a) oe 


R(w) = |(B — cos w)(A cos w)|7"/4. 


Here we have chosen ce = 0 in (142) to obtain a matching of U3 and U4 
at cos w = A + u/N. Indeed, one finds in a straighforward way that 


- d3N1/4 
Us (arccos (A + n)) ~ TAB — ayia — Ay 





X exp ( Fis + N12 re u— (Cha) (144) 
where 
d 
Ls= f. Vee ne ae 
dé 


L . 145 
aa Wess eer aes 


Comparison of (144) and (139) shows that 
dg = V2n NV2(1 — A2)-M4e -(NL3/2)-(CLa/4) 5, (146) 


An asymptotic solution to (132) near the turning point cos w = yo is 
obtained by substituting cos w — B = t/N?/3 to obtain 
d?U - t 
dt?) 4(1—B*)(B-A) 


Thus, near cos w = B, we find 


U + 0(N-2/3) = 0. (147) 


/ = 
N2/3(cos w — B) ) (148) 


[41 — B2)(B - Ay} 
(see Ref. 10, 10.4.1, p. 446). Here we have chosen the asymptotic solution 


of (147) that agrees with Us at cos w = B — u/WN. Indeed, from Ref. 
10, 10.4.59, page 448, we have that 


U ~ Us = doAi G 





u . N'46y 
Ue (arceos (B 7 vn) = del aa =p (B= DE) 
do Ni6y —1/4 
a — B*)(B- au 
N1/4y,3/2 
xe (-Saa-ane-ape) 
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On the other hand, from (143) we find that 


U3 (arccos (B = a) ~TB- AA cs 77 


N149 y3/2 


ENP G 2 3[0— B2)(B— A)]!2 


) (150) 


so that on comparison with (149) we must have 


9-5/6 
de = —— N-V] — B2)1/12(B — A)1/3d,. 151 
oe ( eel do (151) 
On the other side of this turning point the solution U2 continues as 
u —N1/6y 
be) dan (ta ed 
ee (arecos ( VN 2 \T4 — B2)(B — A) 
do | NiV6y ie : ; N/4y3/2 ie 
(40 —-B)(B-A}2) >" 13Vaad— BBA) “| 


(152) 


as seen from Ref. 10, 10.4.60, page 448. 
Applying (141)-(142) to (136) for 1 = cos w > B, we find that 


cosw—B 
E = , eee ee 
: Vs w— A) 
On recalling (134), we find the asymptotic formula 


U~UL= d:R(w) cos | © f° jiak, 


cost -—A 
at oak 
4 Jo vV (cost — B)(cost — A) 


with R(w) as in (143). Near the turning point cos w = yo, this becomes 


a d,N1/8 
U, (arecos (Bt) ~ TI/A(B — Ayia Ay 


X cos ez - 
2 


| (153) 


2/,N 1/44, 3/2 C 
V4(B—A)1—B2) 4 gee) Mee 


where 
d 
i= f v4 ae 
dé 
L : 155 
a Soa eo 
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Comparison of (154) and (152) shows that we must have 


dg = Vx N*/6[4(1 — B2)]-1/12(B — A)-1/3d, (156) 
and 
N C T 
—[L,-— =— . 
cL a Lot @ 7 (mod 27) (157) 


Turning now to the interval A > cos w = —1, we find from (141)- 
(142) 
ae 
[(B — cos w)(A — cos w)]}/4 
B-—cos a 
x 
ais if JS. / Bast — cos :2 
© dt 
a8 ae a 
4 Jw V(B-cost)(A — cost) 
At cos w = A —u/N this becomes 


“)) a dgN1/4 


Us (arccos (A = N (B — A)¥4y 1/4 . 


U~ Ug= 


at |. (158) 





N = 
X cos [pis miey/ 2 a ut Slot | (159) 


1—A2 
where 
iss {Voom 
(A= a = ) 
dé 
Lg = ——— 160 
: iG (B-(A-Ha-2) oe 
Comparison with (140) shows that 
dg= 4/2.N-1020 — A2)/4q,. (161) 
Tv 
N 
3 ist “Le +0= ; (mod 2r). (162) 


Equations (138) and (148) provide asymptotic solutions to (132) at 
the two turning points cos w = B and cos w = A. Equations (153), (143) 
and (158) provide asymptotic solutions for the regions away from the 
turning points. Equations (146), (151), (156)—-(157) and (161)—(162) in- 
sure that these solutions join together. This solution is summarized in 
Eqs. (42) where the regions of validity for each piece are shown explicitly, 
and in (48)—(48). As presented there, the constant ¢ of (153) has been 
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chosen as —[1 — (—1)*]7/4 as it must to satisfy the inequalities shown 
in (9). 
To normalize the solution (42)-(44) we must compute 


W; = d? Sg?(w)dw (163) 
i=1,2,...,6 


where the range of integration for each g? is the range of validity for that 
g given in (42). We then require that 


3 W; = 7 (164) 
1 . 


since w = 2rf. 
Asymptotic forms for the W; are readily worked out. One finds, for 
example, 


arccos (B+1/N) 
W, = d? f, 


Bi(w)dw 


~~ 


ol ¢} — dé 2 
2 Ja+iivn V(E—-BYE-A)L—- £3) 2 Oil, A169) 


while, with » = [4(1 — B?)(B — A)]-14, 


B B+1/VN Ai2(—N2/3p(t_ — B)) 
Wenal[ f° aus GM" gy] AbtcNtente — BD 
2 8 Dp-avN B Poa 


_ GEC APIN ede | d? 1 AI-=NUv8)de 


VN Jo V1—B? VN Jo V1 — B?2 


By using the asymptotic forms for Ai(x) (see Ref. 10, 10.4.59, 10.4.60, 
page 448) one finally finds 


d? = d? De Wi 
——— N 7/12 = cE 
rVyV1 — B2 N44 Le N14 


where c is independent of N. In a like manner, one finds that all the ratios 
W;/W,, i = 2,3,..., 6 vanish with increasing N. We omit the details here. 
Equations (164) and (165) now give 7 ~ W, ~ ‘4d?L so that d; = 
[22/L»]1/2. Equations (146), (151), (156) and (161) now determine all the 
d’s to have the values given in (48). 

Recall now that U;(N,W;f) has k zeros in —-W < f < W. For the so- 
lution we have just constructed, all zeros in (— W, W) are contributed by 
U, of (153). From (154) we see that the number of zeros is given 
asymptotically by 


He 2 Nw4 C }|~% 





Wor 


k= 


—L, 
2 3V4(B—-A)A—B2 4 


us 
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Thus if we set k/N = 2W(1 — «) ~ L,/x, we must have L; ~ 2xW(1 — 6) 
= ak/N which is (48). 

Finally, the two phase continuity requirements (157) and (162) must 
be met. The first of these is satisfied by the choice of C given in (45). This 
number lies between zero and 8z/L» and hence is 0(1) as has been as- 
sumed throughout the development. Equation (162) is satisfied by 
choosing @ as in (46). 


Case B.1=B>A>-—lork = 0(1) 

If, in the preceding analysis, B is allowed to approach 1, the first 
turning point approaches w = 0 and the subinterval of (— W, W) in which 
U can oscillate becomes vanishingly small. This suggests a separate in- 
vestigation of (132)—(133) around w = 0 when B = 1. 

Substitute 
_ (2a)1/4 

VN 


into (132), where, as before, a = 1 — A. One finds 


d2U Cc. /2 t? 1 
<—+|-=1/=-= U+0(—)=0. 
dt2 4% q A N | 


Asymptotically, then, U is a solution of Weber’s equation D 
+ (x — %4t?)D = 0 (see Ref. 9, Vol. II, 8.2, page 116) which has bounded 
solutions only if x = k + 4% where k is a nonnegative integer. We are thus 
forced to take 


c= (x +5), = 7 N2- (+5) v/ <n +00). (166) 


The corresponding solution is generally denoted by D; (t) and has exactly 
k zeros (Ref. 9, Vol. II, 8.6, page 126). Thus we are led to take 





25) t 


Up(w) ~ cvDe(t) = eDe (S-) me) (167) 


for fixed t, or w = 0(N-!/2), as reported in (39). Examination of higher 
order terms (omitted here) shows (167) to be correct asymptotically even 
for w = 0(N-!/8), whence the range of validity shown in (38). 
Solutions of (132) near A and in the regions away from the turning 
points can be obtained in the present case from U3, U4, Us and Ug of 
Section 4.2, Case A, by letting B = 1 and C = —Va@/2(k + 14) in (138), 
(143), and (158). The indicated integrals in these last two equations can 
now be carried out explicitly. Equations (39) result. The constants c1, 
Co,..., C5 are then adjusted so that the solutions match asymptotically 
at the edges of their regions of validity. Finally, the solution is normal- 
ized. Again the oscillatory part near f = 0 dominates the asymptotic 
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behavior of f12,.U;,(N, W;f)2df and we find 


2 N73 9a \ 1/4 
Vi=cj f, fi(w)?#dw = Ga) 


2 D2 
x et| f D2(t)dt — ie real Dita | 
1/4 
pe ae 2kp —t2/2 
ot (se 4, Re Jum . ae| 


peer an En (168) 


(See Ref. 13, 7.711-1, p. 885 and Ref. 9, Vol. II, 8.4.1, page 122.) This 
determines c,; and the values shown in (40) are obtained. We omit the 
straightforward but tedious details here. 


Case C.1>B=A>-lork =|2WN + (b/z) log N] 

If, in the analysis of Section 4.2, Case A, the parameter B is allowed 
to approach the value A, the two turning points coincide at cos w = A 
and a new analysis of U in this neighborhood is now required. 

With 6 = AN2+ CN + O(1) and 





B= rae a [csc 2nW|, 
in (133) substitute 
cosw— ARID, at a (169) 
to obtain 
1. I. 
gF” + (1— )F— 5 (1 — iE 6)F +0 & =0 (170) 
where 
E= ; (A -—C). (171) 
For large N, then, F(&) ~ &(a,1;&) where 
a= ; (1 —iE£) 
and 
2 
agers = See 


c(c + 1) 2! 
is the confluent hypergeometric function. (See Ref. 9, Vol. I, 6.1.1, page 
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248, and 6.2.6, page 250.) Thus for cos w near A, we have 
U ~ ege'(8/2)N (cos »— A) b(a,1;—i BN (cos w — A)) 


as reported in (51). This expression is real. 

Solutions away from the double turning point cos w = A can be ob- 
tained from (153) and (158) by setting B = A. The integrations can be 
done explicitly. The functions h; and h3 of (51) result when C is replaced 
by E via (171). 

There now remains the task of choosing the constants so that hy, ho, 
and hs join properly. We indicate a few key steps. 

When 


u 
N23 ° 





cosw=At 


N 1 E 
hy ~u7!N13 cos E arccos A — 5 NY Bu —- = log u 


N1391/2 





+ EB log —k | - (172) 


To develop an asymptotic expression for he at this point, we avail our- 
selves of the formula (Ref. 9, Vol. 1, 6.13.1.2, p. 278) 


P(a,c;x) ~ oa (= + ae a nae (173) 


where « = 1if Imx > 0 and e = —1ifImx <0. One finds 


ho | arceos (4 + A) 
e—7EB/4 


~ (EB) V2y 


where y = 148N!/3u and where the real functions r and w are defined 
by 


cos |- y —W(EB) + ; - “EB log 27 | (174) 


Tf. : 
T 6 —_-= ia) = r(a)ei(a), 
2. 2 


Comparison of (172) and (174) shows that we must have 
ey = B12r(EB)eP4NV2e, (175) 


and 
«WN + = log N — = log ; + W(EB) —k ae ; =(0(mod 2x). (176) 
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When cos w = A — u/N2/, from (51) one finds 


N13 N 1 E 
hs mer cos EB arccos A + 5 Nu _ > log u 


N1/391/2 





+ EB log ~(k +1) S| (177) 


while from (51) and (173) 


ho | arceos (A — xin) | 


~ (EB) V2 
Comparison of these last two equations yields 
e3 = B—-1/2r(EB)—leEea/4N 1/26, (179) 


cos [7 — ¥(Ep) — = log 2y — *|. (178) 


to match the amplitudes, while matching of the cosine arguments gives 
(176) again. 

Now the number of zeros, ki, of (50) in the interval (0 S w < 
arccos (A + N-2/3)) is seen from (172) to be given asymptotically by 


mene) 
B 


while asymptotically the number, ko, of zeros of U in (arccos (A + N-2/) 
< w < 2) is obtained from (174) as 





Bea | «wn - 5 Nuss + EB log 


as 


ho~= | anv + (EB) EE es anv |. 
aL2 4..- <o 
Thus the number of zeros of U(f) in (- W,W) is given by 
2 
k = (ki + ho) ~= [ wv + 2 tog FB igo 4 WEB) - “|. 
us Z 2 2 4 


(180) 


This motivates the choice of E as a root of (53). When this is done, the 
matching condition (176) is also satisfied. 

The constant e,e2,e3 must now be determined by (175), (179) and the 
normalization requirement (9). Routine calculations show that 


P arccos [A+N~2/3] 
Xi set f [hi(w) Pde 


1 arccos [A+N-2/3] d 2 
~oet f : — Se ~ EE log N, (181) 


2 cosw—A 


1414 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1978 


Oe ae 2 
aes Sorc {[A—N-2/3} [halo] ae 


1,7 dw e28 e7Be Eb 
2 2 fies [A—N-2/3] A — COS w 3 3 ee 


while 


ue ; arccos [A—N-2/3] [h ( )]2d 20 1 2 0(1) (182) 
2=e9 i [A+N-2/3] BM ee Go _ 


which is negligibly small compared to the first two integrals. The nor- 
malization integral thus gives 


ef [1 + e£8t] log N =z. 


The values (55) then follow where the factor (—1)l*/2! is dictated by 
(9). 

We have assumed throughout this analysis that EF = (A — C) = 0(1). 
It follows then from (53) that we must have k = 2WN + (EG/z) log N 
+ 0(1). If then, we write 


Pe | 2W +? tog 1 | 
TT 


as in (49), it is seen that for the root of (53) we have 
Ew~ b/s. (183) 


Consideration of the detailed nature of Y shows that E must be taken 
as the root of (53) of smallest absolute value. 


4.3 Asymptotics of \,.(N,W) for large N 


The values of \;,(N, W) for large N reported in Section 2.5 are obtained 
from the asymptotic expressions for U;(N,W;f) given in Section 2.4 by 
means of the basic relation 


WwW 1/2 
\x(N,W) = f ( Us(N, Wf df | f (URN WsfPaf. (184) 
Let 
V; = Sc2f?(w)dw b= 1,9) 242,45 
W; = Sd7g?(w)dw i=1,2,...,6 
X;= Serh2w)dw i =1,2,3, (185) 


where the ranges of integration are given by the corresponding intervals 
of validity shown in (38), (42) and (50). 
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For fixed k and large N, 1 — \x, ~ (V4 + V5)/zm since > V; = 7. Now 
straightforward developments yield 





arccos [A—N~-2/3] N 
Vazek ff | VA= wo50 | de 





2rW V2—-—a 
N2/3 VI—aq C2 
ae oS Al vi | Fe aa: ee GT 
NBV/t=ai Sy 7° | ama on NA 
ee ae Ge 2 
V5 = C5 i {[A-—N-2/3] flw)de 
2 A-N7-2/3 dt 
S i 2. (187) 


sere VA=pd-pd=2) woO-A) 


But, from (40), c3/(N4/3c2) = 0(N-/3) so V4 is negligible compared to 
Vs and we have 


I= M~eV 5 ~ cF/2V2(1 — A) 


which is (58). 

The formula (59) is obtained from 1 — \, ~ (Ws + We)/x using (42) 
and (44). One finds Ws ~ d3/(2rN3/2/B — A) and Wg ~ Yod2Le. 
Equations (48) now show Ws to be negligible and 1 — Ax ~ We/a ~ 
Led3/2r. Insertion of the value for dg in (48) gives (59). 

Formula (60) arises from 


1 2rWw 
=— | x + f ethadde | 
T arccos [A+N7-2/3] 


We have already commented in connection with (182) that the integral 
here is of smaller order than X, so that 


= ~X, ~ a log N = [1 + e#67]-1! (188) 


by (181) and (55). Since E ~ b/@ by (183), (60) results. 

Finally, the approximation (61)—(62) arises from (188) and (53) by 
solving the latter approximately for FE. From the theory of the I func- 
tion (Ref. 10,.6.1.27, p. 256, and 6.3.3, p. 258) one finds that Y(s) = 4(y 
+ 2 log 2)s + 0(s?). Inserting this in (53) one finds 

kn 


Nxw -—-— 
2 4 

Log SM} 
sg B 97 


This together with (188) is (61)—(62). 


Ep~— 
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4.4 Asymptotics of \,(N,W) for small W 


Consider the matrix eigenvalue problem 
N-1 ; 
~ Klex;,cxjwyy; = ui, 1 =0,1,...,N—-1 (189) 
j=0 


which has solutions only for those values of v = 1/u for which the deter- 
minant D(v) = |6;; — vK (cx;,cx;)|N—1 vanishes. Here, as in the rest of this 
section, we denote by |f(i,7)| v—1 the determinant of the N X N matrix 
whose element in the ith row and jth column is f(1,j), i,j = 0,1,..., 
N — 1. In (189) we consider the function K(- , -), the weights w; and the 
points x;,j =0,1,...,.N — 1as given. The number c is a parameter. For 
the determinant we have the development in powers of 


N 
Div) =1+ Di (-1)"d,v, 


n=1 
where 
1 N-1 N-1 
dp=— Yves YL [Klexe exe Wejl,-r 
nN. €9=0 £n-1=0 
If now 


K(x,y) = Najxty/, ao =1, (190) 
0 


the development in the appendix of Ref. 21 from equation (A4) to (AQ) 
can be repeated step by step with all integrals replaced by sums to show 
that for small c we have for the eigenvalues of (189) 


Mn = c?"x9(n)[1 + O(c)] (191) 
n=0,1,...,N-1 


where 
xo(7) = — ila lreeile (192) 
lalneat lites lySt 
Here 
N-1 
hy, = DY x}Ju;j. (193) 
i=0 


To use this result to obtain asymptotics of \,(N,W) for small W, di- 
vide (18) by 2W, and write 


A Ps 
;=j,j/=9,1,...,N—1. (194) 


c= 2nW, b= oop Xi 
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Equation (18) then becomes (189) with w; = 1 and 


_sin(x—y) _ 2 (-1)"(x —y)™ 


K(x,y) xy 7 0 (2n + 1)! 
(-1)" ) xi(—y)2-i 
~ 2 (Qn + 1)! en) 


For evaluation of (192) we thus have 


0, i+jodd 
Gig = { (—1)si+G4))/2 se regen (196) 
it@+jt+1) 
and 
N-1 
ie ee Be (197) 
2=0 
To evaluate (192) we first note that the equations 
n 
> ay Yj = Sin, 1=0,1,...,n (198) 
j=0 
yield 
Y, = laij|n—1 , (199) 


|aij|n 
the reciprocal of one factor of (192). Now, from (196) we see that we can 
also write 

-[)yitG)/21. pi. 
Presses ata a (200) 
ity! 2 J-1 


Insert this expression for a;; into (198) and define 


re 
F(t)= & (-1)s#2y;, (201) 
Equation (198) then reads 
1 ‘ 2n! 
F(t)t'dt = ———~ bin 202 
J Futiat = a (202) 
P= Ob 2c. cn. 


But F(t) is a polynomial of degree n orthogonal on (—1,1) to t?7,i = 0,..., 
n — 1. We can write therefore F(t) = kP,,(t) with P,,(t) the usual Le- 
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gendre polynomial. Now 
1 Qntl(n!)2 
PADtd =] 
f, n(t) (2n + 1)! 
(see Ref. 10, p. 786, 22.13.8-.9) so the last of equations (202) shows that 
k = (2n + 1)!/2"n!'(—1)"/2. We thus have 
(2n + 1)! (2n)! | ue, n(n — 1) 
2°n'(—-1)"/2 27(n!)2 2(2n — 1) 


—1)3n/2 
gh ed 
n! 


F(t) = kP,,(t) = tr-2 4, | 


on using an explicit form for P,,(t) (see Ref. 10, p. 775, 22.3.8) and on 
recalling the definition (201). Comparing coefficients of t” we have now 
established that 


1 ln Q2n ! 2 
Aes ae ee (203) 
Yn, |aiyjln-1 (2n + 1)!(2n)! 


It is not difficult to obtain this result by direct evaluation of |a;;|, which 
is a product of Cauchy determinants. 

We use a similar technique to evaluate the second factor in (192). The 
equations 


n 
hj4jZ; = bin, t= 0,1,...,0 (204) 
=0 


j 
yield 


lhitjln— 
Zn = 71+ | iP (205) 
[hisj | n 
Using the definition (197) of hj+;, (205) becomes 
N~-1 . . 
> x'G(x) = bin, 1=0,1,...,n (206) 
x=0 
where we have written 
G(x) = ¥ Zjxi. (207) 
j=0 
Thus we seek an nth degree polynomial G(x) satisfying (206). The 
coefficient of x” will give the desired ratio (205). 
The Tchebyshev polynomial t, (x) (see Ref. 9, vol. 2, pp. 221-223) has 
just the properties sought. It satisfies 


N-1 
>. x't,(x) =0, i =0,1,...,n—1. (208) 
x=0 


PROLATE SPHEROIDAL WAVE FUNCTIONS—V_ 1419 


An explicit formula for the polynomial is 


to(x) =1 
tn(x) =nlA"[(x)n(x —N)n], n=1,...,N—-1 (209) 
where we write (x), = x(x — 1)(x — 2)... (x —n+1) and define the 
forward difference operator A by Af(x) = f(x + 1) — f(x) and A"f(x) = 
A[A"—1f(x)]. The polynomials satisfy the recurrence 
(n + 1)tp4i1(x) — (2n + 1)(2x — N + 1)t, (x) 
+ n(N2-—n?)tn_-i1(x) =0 (210) 


n=1,2,...,N—-—2 


(Ref. 9, vol. 2, p. 223, (6)). 

From (210) by using (208) we can easily calculate 24 lx "tp (x) = Sp. 
To do so, multiply (210) by x”~! and sum. Recalling (208), one finds 
2(2n + 1)S, = n(N2 — n2)S,_1. Since So = N, we have 


Nn! Il (N2 — n?) 
— (211) 


n 


~ 9"1.3-5..,-(Qn +1). 
It follows then that 
1 
G(x) = tn(x) (212) 
Sn 
is the nth degree polynomial satisfying (206). 
We now seek the coefficient of x” in G(x). The coefficient of x” in 


ty (x) is not evident from the definition (209). However, it is easy to show 
that A(x), = n(x),—-1 and that 


An[f(x)e(x)] =O ("") Ajf(x)A"~ig(x + j). 
j=0 \J 


Applying these rules to (209) we obtain the alternative expression 


n n 2 : 
ta(x) = > ( .) (x)n—j(x — N+ j)j- (218) 
j=0 \J 
It follows then that the coefficient of x” in t,, (x) is 
2 
ECO) a 
j=0 \J n 


(see Ref. 13, p. 4, 0.157-1). From (207), (212) and (213) it follows that Z,, 
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= k/S,. From (205), (211) and (214), then one has 


TL W-/) 
j=Hon 


1 

Soe ia ee ae (215) 

h; iIn— Lin 2 
[isi |n—1 (2n + 1) ("”) 
n 


This result combined with (203), (191), (192) and (194) yields (64)— 
(66). 
Since |hi+;|o0 = N, (215) yields 
n (j!)4(N2 — j2)nt1- 
h; ‘n= Nt fa a 
Heel Mm i+ DIQP 
a formula that seems difficult to arrive at by direct manipulation of the 
determinant. 


lhisjln 7 


(216) 


APPENDIX A 
Asymptotic Behavior of no 


We here investigate the product (93) for large N. We adopt the ab- 
breviation W’ = WoT’. 
Suppose first that 0 < W’ < 14. We write (93) in the form 


~ log ny = Pi + Pa + Ps (217) 


where 


pS IN +10) 
: oe (sap 0 : ) 


N 
1 2WIN-2) dpa (N + 1,W’) 
Ps =— log ————__———- (218) 
aN i Arn (N, W’) 


1 N= Anti(N + 1,W’) 
Ps: =_ lo F 
N p=2W'N-1 Ar (N, W’) 


In this last sum set k = N — 1 — @ and use (18) to obtain 
1NQ-2W) = _y_p(N+1,W’) 
Di! NOR ee 
£=0 An-1~e(N, W’) 
yw ie 1—Ag(N + 1,W) 


where we have written W =  — W’. Now from (59), if 2 = sN, with s 
fixed and 0 <s < 2W, 1—Ag(N,W) ~ exp [—14C(B,N)L4(B) — NL3(B)| 
where C, L4 and L3 are given by (45) and (47) and B is determined as a 


Ps 


Zlr =| 


(219) 
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function of s by (48), namely 


& — B(s) _ 
~f Veo Cae (220) 


In these formulas we now have A = cos 27W. 
For large N, then, and fixed s, a term in (219) takes the value 


1-dA(N+1,W) 1 


J TNT) Tg CBN + DLAB) — (N + DL3IB) 


Ul 
oy 


: [- * C(B,N)La(B) - NLo(B) | (221) 


where 


i Gaaee (= a (&— A)(1 — £2) 


Ss 
“Net No NNED-: NSIT 





(222) 
Thus ‘ 


B=B (s _ aa) B(s) — sB’(s) + 0 (x) : 


Straightforward Taylor expansions of quantities on the right of (221) 
now give 


Fics, 2 | 22806) = 5-4, La(B)) | 450) 


Returning to (219), we have 


Pow 5 [Ls (B(<)) - - © 15(B(s))|o= aw | + o(1) 


so that 
lim P;=— a — “ Se cata ds 


No 
2 "-f- Leds 


225 i L3(B(s))ds. (223) 





--{" Ls3ds + aba\: 


1422 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1978 


Now recalling (220) and the definition (47) of L3, we have 


Jim Poa 2 fas coer ow 
it i °S. eae erS 


= 2 log (sin rWoT' 224 
SS sae g (sin tWoTo). (224) 
Details of the evaluation of the triple integral to obtain the last line here 
are given in Appendix B. 

It is not difficult to see that both P; and Ps, approach zero as N in- 
creases. We omit the demonstration here. Our result thus far reads 





l 
lim = 7 — 9 Jog (sin rWoT), 0< WoTo <% 
No 


which is (94). 
To study no when W’ = WoT > 4, we return to (18) and consider it 
now for arbitrary values of W. Since 


sin 27 (w+) (n —m) ‘, 
2 S [= 21W(n —m) 


a(n —m) 


Te iam | (=D, 
(225) 


a(n —m) 
the eigenvalue equation 
is sin 2a (W + >) (n — m) 
>~ -— Tv) (NW + 5) 
m=0 a(n —m) 
1\  «@) 1 
= Np (NW + 5) vl (N.W + 5) 


can be rewritten on direct substitution of (225) as 


Sapam: oe) 


= | -1 ae (N.W + 5) | [neo (NW + | 


Comparison with (18) now shows that 


Ar (NW 5) =1+d,(N,W) 
pt) (N, W+ 5) = C(-1)™v)(N, W). (226) 
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Suppose now that 
+ 
Z 2 
where n isa oe integer. Then in virtue of (226) Eq. (93) becomes 
In + Anyi(N + 1,W”) 





(227) 


+ Xo(N + 1,W” 228 
no = a oi of )] a n+ d(N.W”) (228) 
Ww’ = W’ —— n 
2, 
1 
0< W” rs . (229) 
Then 
log no = Qi + Qe (230) 
where 
o2[n + Ao(N + 1,W”)] o2(n + 1) 
= log ——__——_— + ~ log —__—— 
@1 = log ai 08 ap (231) 
and 
N-1 Be ” 
n+ Nee(N +10) eis 


Q2= Oe NW”) 
When N is large only the terms for k near 2W’N contribute signifi- 
cantly to Qo, for if k = 2W” N(1 — 6), Ax approaches 1 exponentially, 
while if k = 2W”N(1 + «), Ax approaches zero exponentially. In either 
event, the summand in (232) approaches zero exponentially while the 
number of term grows linearly with N. 
Consider now 


2WN+(a/x) log N n+ dAp4i(N + 1,W”) 


H(a,N) = re) 
h=2WN-(a/x) log N n + dA, (N,W”) 


(233) 


where @ is an arbitrary positive real number. We change from the vari- 
ables N, k to new variables A and b via the transformation 





Tw 
= N — a/A 
log N 
b=(k-—2W’N)A = . +2Wer/A (234) 
and write 
An (N,W”) = g(A,6). (235) 
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Then Apgai(N + 1,W”) = g(A’,b’) where 


1 A2 
Se AE 
log (N + 1) ee 
and 
b’=[k+1-—2W’(N 4+ 1)]A’ =b+4+ (1—-2W”)A+0 (5) . 
Thus 


/ / _ —_ uf og(A,b) = 
g(A’,b’) = g(4,b) + (1 — 2W”)A Ab eg 


and the summand of (233) becomes 


n+ g(A,b) + (1- ow") oe +0 (5) 


nt+ Anti iN + 1,W”) N 
aaa enn CRTAT ICR eccem a (1) 


| 
(bn + (N,W”) n+ g(A,b) 


iA A 
= (1 —2W”)AL(A,b) + 0 ) 
where 


L(A,b) = log [n + g(A,b)]. (236) 


Now write j = k — 2W’N. Equation (233) becomes 


H(@N) = (1- 20”) = | bean a (x) | | 


j==al. 


In the limit of large N, we have 
H(a,N) ~ (1- 2W”) f “ L(0,x)dx 

= (1 — 2W) log [n + g(0,x)]|%--z (237) 
from (236). But (60) and (235) show that 


0,b) = ——— 
g (0,0) ie 


and so 
n+[1+e74]7} 
n+ [1+ e-74]-1° 


Finally, since @ can be chosen arbitrarily large, 


H(a,N) > (1 — 2W”) log 


n 1 (1-2W’”) 
H(a@,N) — (1 — 2W”) log = —log (1 + =) (238) 
n+l n 
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Thus if 


lim Q2= lim H(a,N), 
No No 
we can write 
_ o%(n +1) 1 


~ io = — 239 
me 2W’ (1 nye eee) 


n 
in virtue of (229)-(231). This result is (95). 


APPENDIX B 
Evaluation of Integral (224) 


Ss ae oe et 2 SS 
Js lim Py=-— f dB de Vir aya aes 
1 


1 dt 1 dt 
x Sf, Aes Si V(t — A)(1 — t?) 


t dé t B-é 
x |. ve=maess J, wy ra 


eee Pig cea EY 
og J, a J, i (1 — &)(1 — t?)(A — t)(A — &) mo) 
since under the substitution u? = (B — £)/(t — B) 


fan 2=. (ig) \. eee pd 

















1 Ba ° du T 
=(t-£)|- + =(t—é)—- 
| “T+ u2 Io Si al ae: 
Now change from the variables of integration £,t to a,G via 
1+A 1-A., 
f= sin @ 
v8 
1+A 1-A., 
t= a Se sin 8 


and obtain 


1 a/2 B i — si 
J=-5f ap a ae ae 
2 J-2/2 —71/2 V(x + sin a)(x + sin £) 
where we write 


3+A 
x= ,l<x<om, 
1-A 
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Changing now to variables ¢,y through the 90° rotation 1o(a + 8) = ¢ 
Yo(a — B) = W, we find that 


Jao fo 4 v foie cos ¢ sin y 
x/2 x/2 oe — sin? yy + 2x cos sin ¢ + sin? 


= 2 f ; dy sin y [log |x cos p 
—1/2 
+ sin 6 Vx2 — sin? y + 2x cos W sin ¢ + sin? 6] $274.22) 


0 
=2 f dy sin y 
—1/2 


x cosy + cosy + Vx2 — sin? Wy + 2x cos? W + cos? W 
x cos WY — cos Wy + Vx? — sin? y — 2x cos? W + cos? p 
--2 {7 Avie alau + Vb2 4+ u?] 

° b[bu + Va2— U2] 


where we have set cos YW = u and 


X log 








a2 — b?=1. (241) 
Thus 

1 
J= —2 log = — Z f, du log [au + Vb2 + u?| 


+2 f du log [bu + Va2—u2]. (242) 
Now it can be verified by direct differentiation that when (241) holds 
Sdu log [au + Vb2 + u?] 
= (ut 1) log [au + Vb? + v2] — u — log [u+ b? +a Vb2 4+ U2] 
and 
Sdu log [bu + Va? = u?] 
= (u + 1) log [bu + Va? — u?] — u — log [a2 -u + b Va? — v2]. 


It then follows readily that the last two terms of (242) both have mag- 
nitude log 2(a + b) — 1, and that equation becomes simply 


1+ 
J = —2 log 5 —= —2 log Vittns = 2 log Vita : (243) 


When A = cos 2x W = cos 24(4% — WoT) = —cos 2nWoT, (243) yields 
(224). 
The simple result (243) for the integral given by the last member of 
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(240) was first obtained by J. A. Morrison by a route involving elliptic 
functions and identities among them. 


APPENDIX C 
Commutation of L and M 


Let operators 
b 
L= { dfK(fs) 
and 


= ss + q(f) 


be given. Then 
b 
MLg = f df/M K(f) (f") 
while 


b 
LMg = f df’K (ff )Mpe(f) 


Se 


b 
< § df'g(f)MyK ff’). 
Now if 
p(a) = p(b) = 0, (244) 
we have 
b 
MLg — LMg = f df’g(f’)[M; — Mp] K (ff) 


and so, if in addition 
MK) = Mp KGS), (245) 
the operators commute. In the special case when K (f,f’) = K(|f — f’|), 
(245) becomes 
2K (lf — f’ 
pin — py) Ps pnt ore 
+ [aM — a IK (If — f/|) =0. (246) 
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Applying this to the operators of (110) and (111) we have p(f) = 
(yx2)[cos 2af — A], g(f) = %4(N2 — 1) cos 2xf and a = —b = W. It is 
seen that (244) is satisfied. To verify (246) observe that 

ee . 
p(f) — p(f’) = —~>sin a(f +f’) sin a(f — f’) 


272 


pf +p(fy =- - sin x(f + f") cos x(f—f’) 


a(f) - a(f) = ; (N2 — 1) sin a(f + f’) sin x(f —f’). 


Thus every term of (246) in the present case contains a factor 
sin 7(f + f’). This equation then is equivalent to 
d? sin Nt d sin Nt sin Nt 


sin t —~ — +2 cos t —— + (N?2 — 1) sin t — 
dt? sint dt sint ( ) sin t 











=Q (247) 


where we have written t = x(f — f’). The reader can readily verify that 
(247) holds identically in ¢. 


REFERENCES 


. D. Slepian and H. O. Pollak, ‘“‘Prolate Spheroidal Wave Functions, Fourier Analysis, 
and Uncertainty—-I,” B.S.T.J., 40, No. 1 (January 1961), pp. 43-64. 
H. J. Landau and H. O. Pollak, “Prolate Spheroidal Wave Functions, Fourier Analysis, 
and Uncertainty—II,” B.S.T.J., 40, No. 1 (January 1961), pp. 65-84. 
H. J. Landau and H. O. Pollak, “Prolate Spheroidal Wave Functions, Fourier Analysis 
and Uncertainty—III,” B.S.T.J., 41, No. 4 (July 1962), pp. 1295-1336. 
D. Slepian, “Prolate Spheroidal Wave Functions, Fourier Analysis, and Uncer- 
tainty—IV,” B.S.T.J., 43, No. 6 (November 1964), pp. 3009-3058. 
. D. Slepian, “Some Asymptotic Expansions for Prolate Spheroidal Wave Functions,” 
J. Math. and Phys., 44, No. 2 (June 1965), pp. 99-140. 
. D. Slepian and E. Sonnenblick, “Eigenvalues Associated with Prolate Spheroidal Wave 
Functions of Zero Order,” B.S.T.J., 44, No. 8 (October 1965), pp. 1745-1760. 
D. Slepian, “On Bandwidth,” Proc. IEEE, 64, No. 3 (March 1976), pp. 292-300. 
E. N. Gilbert and D. Slepian, “Doubly Orthogonal Concentrated Polynomials,” SIAM 
J. Appl. Math., 8, No. 2 (April 1977), pp. 290-319. 
. A. Erdélyi, Higher Transcendental Functions, New York: McGraw-Hill, 1953. 
. M. Abramowitz and I. A. Stegun, Handbook of Mathematical Functions, New York: 
Dover Publications, 1965. 
. Harald Cramér, Mathematical Methods of Statistics, Princeton, New Jersey: 
Princeton University Press, 1946. 
12. A. Papoulis and M.S. Bertran, “Digital Filtering and Prolate Functions,” IEEE Trans. 
Circuit Theory, CT-19, No. 6 (November 1972), pp. 674-681. 
13. I.S. Gradshteyn and I. M. Ryzhik, Tables of Integrals, Series and Products, New York: 
Academic Press, 1965. 
14. P. M. Morse and H. Feshback, “Methods of Theoretical Physics,” New York: 
McGraw-Hill, 1953. 
15. J. Von Neumann, Mathematical Foundations of Quantum Mechanics, Princeton, 
New Jersey: Princeton University Press, 1955, pp. 170-178. 
16. D. W. Tufts and J. T. Francis, “Designing Digital Low-Pass Filters—Comparison of 
Some Methods and Criteria,” IEEE Trans. Audio and Electroacoust., AU-18, No. 
4 (Dec., 1970), pp. 487-494. 
17. A. Eberhard, “An Optimum Discrete Window for the Calculation of Power Spectra,” 
IEEE Trans. Audio and Electroacoust., AU-21 (February 1973), pp. 37-438. 


SO PAP 7 Fw NY 


— 
_ 


PROLATE SPHEROIDAL WAVE FUNCTIONS—V_ 1429 


18. D. W. Tufts, “Comments on ‘FIR Digital Filter Design Techniques Using Weighted 
Chebyshev Approximation’,” Proc. IEEE, 63 (November 1975), p. 1618. 

19. A. N. Balakrishnan, “Essentially Band-Limited Stochastic Processes,” IEEE Trans. 
Infor. Theory, [T-11 No. 1 (January 1965), pp. 154-156. 

20. A. Viterbi, “On the Minimum Mean Square Error Resulting from Linear Filtering 
of Stationary Signals in White Noise,” IEEE Trans. Inform. Theory, /T-11, No. 4 
(October 1965), pp. 594-595. 

21. D. Slepian, “A Numerical Method for Determining the Eigenvalues and Eigenfunctions 
of Analytic Kernels,” SIAM J. Numerical Analysis, 5, No. 3 (September 1968), pp. 
586-600. 

22. H. Widom, “The Strong Szegé Limit Theorem for Circular Arcs,” Indiana Univ. Math. 
J., 21, No. 3 (1971), pp. 277-283. 

23. F. Gori and G. Guattari, “Degrees of Freedom of Images from Point-Like-Element 
Pupils,” J. Optical Soc. Am., 64, No. 4 (April 1974), pp. 453-458. 


1430 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1978 


Copyright © 1978 American Telephone and Telegraph Company 
THE BELL SYSTEM TECHNICAL JOURNAL 
Vol. 57, No. 5, May—June 1978 
Printed in U.S.A. 


An Earth-Space Propagation Measurement at 
Crawford Hill Using the 12-GHz 
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This paper describes a measurement of atmospheric attenuation 
and depolarization, primarily due to rain, of the 11.7-GHz cw beacon 
signal from the Communications Technology Satellite (CTS). This 
beacon source made possible the first fixed-path, nearly continuous 
measurements of earth-space propagation at this frequency. A mea- 
surement is being made at Bell Laboratories, Crawford Hill, in Holmdel, 
New Jersey using a fully steerable, 6-meter-aperture, horn-reflector 
antenna fitted with a dual-sense, circular-polarized feed. The ampli- 
tudes of the copolarized and cross-polarized components are measured 
with a two-branch, stable, narrowband, frequency tracking receiver. 
The receiving system, which was designed to run unattended, is de- 
scribed. Propagation data for a greater than 1-year period beginning 
April 1976 are presented. The attenuation data show that an outage 
time of 21/2 hours per year can be expected for a 10-dB rain fade margin. 
Significant anomalous depolarization effects not directly related to 
rainfall have been observed. 


I. INTRODUCTION 


The launch of the CTS satellite has provided a useful signal source for 
earth-space propagation experiments. The interest is in measuring 
long-term (approximately one year) atmospheric attenuation and de- 
polarization, primarily due to rain. The satellite carries a cw beacon at 
a frequency of 11.7 GHz with a nominal output power of 200 mW. The 
satellite beacon antenna is a right-hand sense, circularly polarized, 
earth-coverage horn with an axial ratio of 1.5 dB within an enclosed beam 
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angle of 17 degrees. The satellite is in geosynchronous orbit and is located 
nominally at 116°W longitude with an inclination of less than 1 
degree. 

A disadvantage in using the CTS satellite beacon for these measure- 
ments is that the beacon is turned off during the spring and fall eclipse 
periods when the solar cell panels are shadowed by the earth. In the fall 
1976 eclipse, the beacon was turned off completely for seven weeks. 
Fortunately, few significant rain events occurred during this time. In 
the spring 1977 eclipse, the beacon was turned off for only short periods, 
approximately one hour each day over a six-week period. 

The rain attenuation data for these eclipse periods were bridged by 
statistically scaling data from a colocated 19-GHz COMSTAR A beacon 
measurement. This scaling technique is described in the appendix to this 
paper. 

The Crawford Hill receiving system is located in Holmdel, New Jersey 
at approximately 74° W longitude and 40°N latitude. For this location, 
the receiving antenna point is nominally at an azimuth angle of 234 de- 
grees with an elevation of 27 degrees. This provides approximately a 
20-km path through atmospheric rain showers. The signal parameters 
for this path are shown in Table I. The clear air carrier-to-noise ratio in 
the copolarized signal branch is 43.4 dB in a 32-Hz bandwidth. The 
cross-polarization signal level for clear air is typically 33 dB below the 
copolarized signal level. A 28.5-dB carrier-to-noise ratio in the cross- 
polarized signal branch is obtained by detection in a 0.5-Hz band- 
width. 

The Crawford Hill receiver provides a continuous measure of both the 
copolarized received signal and the cross-polarized component due to 
rain or other atmospheric effects.! The entire system is normally unat- 
tended and takes data continuously on a 24-hour basis. 


ll. ANTENNA AND TRACKING 


The receiving antenna is a fully steerable horn reflector? with a 6- 
meter aperture. Figure 1 is a photograph of the antenna. The antenna 
feed structure, receiver, and recording system are located within the 
vertex cab of the horn. The antenna has a hybrid-coupled, orthogonal 
feed to provide two isolated signal signal components to the receiver. One 
port provides the right-hand circular copolarized component and the 
second provides the left-hand cross-polarized component. The measured 
isolation of these two output ports using the satellite beacon is greater 
than 33 dB on axis for clear-air propagation conditions. 

‘The receiving antenna has a gain of 57 dB and a 3-dB beamwidth of 
approximately 0.3 degree. The narrow beamwidth necessitates continual 
antenna pointing to track the diurnal satellite motion. The antenna is 
pointed by an open loop technique? using predicted azimuth and ele- 
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Table | — Path parameters for CTS satellite 


Beacon transmitter power = 200 mW = ~-7.0 dbW 
Antenna gain toward Crawford Hill = +18.4dB 
ERP = +11.4dBW 
Path loss (38982 km) to Crawford Hill = 205.6 dB 
Clean air attenuation =0.3dB 
Net loss = 205.9 dB 
Horn reflector antenna gain = 57dB 
Received signal power = —137.5 dBW 
= -—107.5 dBm 
Ground Receiver 
Receiver Tr = 1150K, kTF = —166 dBm/Hz 
Copolarized received signal C/N density = 58.5 dB/Hz 
Copolarized signal C/N ratio in 32 Hz bandwidth (clean air) = 43.4dB 
Residual cross-polarized level below copolarized received signal = —33dB 
(clear air) 
Cross-polarized signal C/N ratio in 0.5 Hz bandwidth (clear air) = 28.5dB 


vation data supplied by NASA. The overall antenna servo pointing ac- 
curacy is estimated to be better than £0.03 degree. 


lll. RECEIVER DESIGN 


The receiver is composed of two branches. One branch measures the 
amplitude of the copolarized signal, the second measures the relatively 
weak cross-polarized signal. The copolarized signal branch is used to 





Fig. 1—Six-meter horn reflector antenna. 
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Fig. 2—CTS beacon measuring receiver. 


control frequency tracking and provide a phase reference for detection 
in the cross-polarized signal branch. Figure 2 is a block diagram of the 
CTS beacon measuring receiver. Each receiver branch uses multiple 
heterodyning to convert the 11.7-GHz input frequency from the satellite 
to a final IF frequency of 9 MHz where envelope detection is carried out. 
The 9-MHz IF signal in each branch is bandpass-filtered through a 
500-Hz wide crystal filter. The 9-MHz IF output from the copolarized 
signal branch is split, with one portion used for frequency control and 
the other envelope-detected. The frequency control component is en- 
velope-limited to provide an input to a frequency discriminator. The 
second 9-MHz IF component is envelope-detected using a phase-locked 
synchronous detector. The detector output is low-pass filtered, its log- 
arithm taken and recorded. 

The phase-locked 9-MHz reference signal for the copolarized syn- 
chronous detector is split to provide a reference signal for in-phase and 
quadrature detection of the cross-polarized branch, 9-MHz IF signal. 
The outputs from these detectors are separately low-pass filtered and 
vectorially summed. The logarithm of the summed output is taken, then 
recorded. 

A detailed block diagram of the receiver automatic frequency tracking 
control is shown in Fig. 3. The components in the control loop are an 
envelope limiter, a temperature-controlled crystal frequency discrimi- 
nator, a gated integrator, a window comparator, a gated clocked up-down 
counter, and a BCD controlled frequency synthesizer with frequency 
doubler. The slow-frequency variation of the satellite beacon signal (~5 
kHz/day) is tracked by measuring the change in the copolarized 9-MHz 
IF signal frequency and controlling the nominal 56-MHz local oscillator 
frequency to return the IF signal to exactly 9 MHz. A portion of the co- 
polarized 9-MHz IF signal is envelope-limited to provide an input to a 
quartz crystal frequency discriminator. The discriminator output is 
integrated to smooth short-term variations and applied to the window 
voltage comparator. The window voltage comparator determines if the 
discriminator input frequency has changed beyond a preset allowable 
amount. The comparator gates a clock signal that either increases or 
decreases the count in a digital register. A BCD output from the digital 
register drives the frequency control of a programmable frequency 
synthesizer. The frequency synthesizer output at about 28 MHz is fre- 
quency-doubled to 56 MHz to provide a local oscillator for the 65-MHz 
to 9-MHz frequency conversion. 

A clock frequency of 1 Hz is used to drive the digital register. With this 
clock frequency and the synthesizer frequency doubling, the receiver 
will track an input frequency rate of change of 2 Hz per second. This has 
been found adequate for the beacon frequency drift rates encoun- 
tered. 
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The use of the digital register-frequency synthesizer technique pro- 
vides an automatic-frequency control with memory.‘ If the received 
beacon signal is lost because of a deep fade, beacon turnoff, or tracking 
malfunction, the receiver remains tuned to the beacon frequency pre- 
ceding the loss and will not normally change frequency until the signal 
returns. The automatic frequency tracking control is provided with a 
frequency scan circuit to retune the receiver automatically if the beacon 
signal is lost and later appears outside the 500-Hz receiver bandwidth. 
This occurs during eclipse periods when the satellite power is shut down. 
The beacon oscillator cools and changes frequency by several kilohertz. 
The receiver frequency scan is delayed about 3 minutes on loss of signal, 
then proceeds to scan at a 100-Hz-per-second rate approximately 8 kHz 
either side of the last known beacon frequency. This scan technique has 
provided rapid reacquisition of the beacon signal during eclipse. 

The overall frequency of the receiver is controlled by a single 10-MHz 
stable crystal oscillator in conjunction with several frequency multiplier 
chains. The crystal oscillator has a short-term (1s) stability of 1 x 10!! 
and an aging rate of <1.5 X 10-7/year. Figure 4 is a block diagram of the 
oscillator and multiplier chains. The first local oscillator at 11.225 GHz 
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Fig. 4—Receiver local oscillator chain. 


is obtained from a phase-locked, varactor-tuned, Gunn diode oscillator. 
The Gunn oscillator output is heterodyned with a stable reference signal 
at 160 MHz in a harmonic mixer. The frequency difference is limited and 
compared in phase with a second reference signal at 25 MHz. An error 
signal from the phase detector is fed back to the Gunn oscillator fre- 
quency control. The second local oscillator at 540 MHz is directly ob- 
tained from frequency multiplication of the stable crystal oscillator. 

Figure 5 is a detailed block diagram of the synchronous envelope de- 
tectors in both receiver branches. A detection reference signal is provided 
by a 9-MHz, voltage-controlled crystal oscillator. This VCXO is phase- 
locked to the 9-MHz limiter output in the frequency control of the co- 
polarized branch. The 9-MHz IF output signal from the copolarized 
branch is synchronously envelope-detected and low-pass filtered. A 
bandwidth of 32 Hz is provided by an active 4-pole Butterworth low-pass 
filter. The signal envelope output from the low-pass filter is passed 
through a baseband log amplifier to the data recorders. The parameters 
of the log amplifier are set to provide a 40-dB dynamic range. 

The 9-MHz IF output signal from the cross-polarized branch is en- 
velope-detected by in-phase and quadrature detectors. The detected 
in-phase and quadrature components are low-pass filtered and vecto- 
rially summed to provide a single output proportional to the magnitude 
of the cross-polarized signal. A bandwidth of 0.5 Hz is provided by active 
4-pole Butterworth low-pass filters. The combined output is passed 
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Fig. 5—Synchronous envelope detectors. 


through a baseband log amplifier to the data recorders. The parameters 
of the log amplifier are again set to provide a 40-dB dynamic range. 

The receiver and data recorder are calibrated by providing a stable 
input signal at 11.7 GHz to the receiver input mixers. This calibration 
source is obtained by frequency multiplication of a synthesized signal 
source which is referenced to the 10-MHz stable crystal oscillator. The 
calibration source was carefully shielded to prevent stray signal leakage 
to the receiver. The calibration source output, controlled by a precision 
waveguide attenuator, is coupled into waveguide switches at the receiver 
inputs. 


IV. MEASUREMENT RESULTS 


The beacon measuring system has been operational since April 1976. 
Nearly continuous measurements have been made of the coplarized and 
cross-polarized components from the satellite beacon signal. Data are 
presented for greater than a 1-year period beginning April 1976. During 
this time, some data were lost during the fall and spring eclipse periods 
when the satellite beacon was turned off to conserve battery power. A 
small amount of data was also lost due to an antenna servo malfunction. 
The copolarized signal data were analyzed for a one-year period to obtain 
attenuation statistics. Using data for attenuation greater than 1 dB, a 
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Fig. 6—Cumulative attenuation statistics for April 26, 1976 to April 26, 1977. 


distribution curve showing the time an attenuation greater than clear 
air was exceeded versus attenuation is plotted as the solid line in Fig. 6. 
This curve shows data for the one-year period excluding eclipse outage 
time. A second curve, the broken line in Fig. 6, shows the effect of 
bridging the outage periods with data scaled from a simultaneous 19-GHz 
COMSTAR A beacon measurement. The frequency scaling, described in 
the appendix, relates measured attenuation statistics at the two 
frequencies during a common time to predict attenuation statistics at 
12 GHz when the CTS beacon signal was lost. The 12-GHz statistics could 
be predicted up to approximately 14 dB corresponding to the 40-dB 
threshold in the COMSTAR A data. Beyond 14 dB, the 12-GHz distri- 
bution curve, shown dashed, asymptotically approaches the measured 
curve since heavy rain with attendant high attenuation did not occur 
during the bridged periods. In general, for practical system design, the 
attenuation statistics are of greatest interest for attenuation less than 
14 dB. 

The bridged curve represents a reasonably good estimate of the at- 
tenuation statistics for the full year. The curve therefore could be useful 
in estimating the outage time of a satellite communications link given 
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Fig. 7—Cross polarization discrimination as a function of rain attenuation. 


an attenuation or fading margin. For example, a system with a 10-dB 
margin would have had a 2!4-hour outage time over the year period. 

The distribution curve shows a change in slope as attenuation in- 
creases. At low attenuation, the logarithmic change in outage time is very 
rapid with incremental decibel changes in attenuation. Above about 
10-dB attenuation, the slope appears to be fairly constant, indicating 
there is a decreasing rate of return in outage time with an increase in 
attenuation or fading margin. Therefore, unless a much lower outage 
time is required for a particular service, a margin of about 10 dB may be 
optimum. 

In addition to the experimental result, Fig. 6 shows several calculated 
points of an attenuation distribution obtained from Lin.° The calculation 
was based on five-minute point rain rate data for a 20-year period (from 
1953 to 1972) for Newark and Trenton, New Jersey for a path elevation 
angle of 27 degrees. These points show the average number of hours per 
year that an attenuation is exceeded for a similar path at these two 
locations. Very close agreement can be observed between the measured 
and calculated data. 

The cross-polarization data were analyzed for several rain events 
during the measurement period. The cross-polarization discrimination, 
defined as the decibel difference between the desired and undesired 
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ELEVATION ANGLE: 27° 


CROSS POLARIZATION DISCRIMINATION IN dB 





RAIN ATTENUATION IN dB 


Fig. 8—Cross polarization discrimination as a function of rain attenuation. 


polarization components, was measured as a function of rain attenuation. 
Data for two types of rain events are shown in Figs. 7 and 8. In Fig. 7, 
measured data shown as dots are for three events during August 1976 
where high rain rates were encountered. The cross-polarization dis- 
crimination was measured for copolarized signal attenuations 1 dB or 
greater than clean air. The measured data show a decrease in discrimi- 
nation with increasing path attenuation. For example, at a 10-dB at- 
tenuation the undesired cross-polarized components are approximately 
14 dB below the copolarized signal and at 20-dB attenuation the dis- 
crimination has decreased to about 8 dB. In addition to the measured 
data, five calculated curves of cross-polarization discrimination are 
shown. These curves from Chu® were calculated from differential at- 
tenuation and phase shift through oblate spheroidal raindrops at 12 GHz 
for five rain rates. A maximum path length of 20 km, an elevation angle 
of 27 degrees, and a single uniform rain-canting angle were assumed. 
These curves represent an upper bound of the theoretical cross-polar- 
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Fig. 9—Anomalous cross polarization event, 11.7 GHz CTS beacon, Holmdel, N.J., Au- 
gust 8, 1976. 


ization discrimination. A reasonably good fit is shown between Chu’s 
theory and the measured data for copolarized signal attenuation greater 
than 5 dB. The measured data generally fall below Chu’s curves by a few 
dB, which could be caused by randomness in rain-canting angle as would 
be normally by expected with wind-driven rain. 

Similar data are plotted in Fig. 8 for three rain events during April, 
May, and June 1977. The events of this period were of low rain rate 
compared to the August data and therefore the copolarized signal fading 
was small, less than 7 dB. The cross-polarization discrimination was 
measured for copolarized signal attenuation as low as 0.1 dB greater than 
clean air. The measured data show considerable scatter with little re- 
semblance to Chu’s curves. This scatter might be due to ice or atmo- 
spheric turbulence effects that are hidden in the more intense summer 
rains. 

In addition to the normally expected strong depolarization effects with 
rain attenuation, some anomalous effects not predicted by contemporary 
rain propagation theory have been observed. One effect, which has been 
reported elsewhere,! is a large change in cross-polarization discrimination 
without an attendant measurable change in copolarization signal at- 
tenuation. A graphic example of one event is shown in Fig. 9. This seg- 
ment of a pen recording shows a simultaneous time trace of the right- 
hand circular copolarized and the left-hand circular cross-polarized 
signal components. The decibel levels shown are relative to the clear air 
copolarized signal level. Significant variation in depolarization was ob- 
served in the vicinity of 1480 hours when the copolarized signal is at its 
clear air level. This effect has been observed both during summer rain 
events, often related to thunderstorms, and during the cooler months 
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with only an overcast sky, near-freezing temperature, and no precipita- 
tion. This effect is very likely caused by differential phase shift through 
ice crystals present in the propagation path. 

A second observed anomalous depolarization effect has been rapid 
discrete steps in the cross-polarized signal during thunderstorms. This 
effect, shown in the cross-polarized trace in Fig. 10, is believed due to 
rapid orientation of ice crystals in the transmission path by lightning 
discharges near the path.’ 

Both these anomalous effects have been simultaneously observed over 
similar transmission paths using the COMSTAR A and B beacon signals 
at 19 and 28 GHz. 


V. SUMMARY 


A measurement of atmospheric attenuation and depolarization at 12 
GHz over an earth-space propagation path has been described. The 
measurement system uses a two-branch, narrowband frequency tracking 
receiver in conjunction with a 6-meter-aperture, horn-reflector antenna 
to measure copolarized and cross-polarized components of the received 
cw beacon signal from the Communications Technology Satellite. 

The measurement system has been operating essentially unattended 
since April 1976, recording the atmospheric attenuation and depolar- 
ization of the satellite beacon signal. Attenuation statistics showing a 
distribution of the number of hours the path attenuation exceeded the 
clear air level have been presented for a one-year period. The distribution 
was corrected for data loss when the 12-GHz beacon signal was un- 
available during eclipse by bridging scaled statistical data from a colo- 
cated 19-GHz COMSTAR A beacon measurement. The distribution shows 
that a 12-GHz satellite communication link with an attenuation or fading 
margin of 10 dB would experience a 2'4-hour outage time in a one-year 
period. 

Depolarization effects have been observed during normal rain fade 
events and when no measurable copolarized signal attenuation was 
present. The depolarizati2n effects observed for rain attenuation greater 
than 5 dB appear to follow predictions based on differential attenuation 
and phase shift through oblate spheroidal rain drops. However, when 
no measurable or very low copolarized signal attenuation is present, the 
observed depolarization effects are very likely caused by differential 
phase shift through ice crystals. 
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Fig. 10—Anomalous cross polarization event, 11.7 GHz CTs beacon, Holmdel, N.J., April 25, 1977. 
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Fig. 11—Cumulative attenuation statistics at 12 and 19 GHz for July 29, 1976 to August 
27, 1976. 
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APPENDIX 


Frequency Scaling of Attenuation Statistics 


To obtain an estimate of the attenuation statistics at 12 GHz when 
the CTS beacon signal was lost during eclipse periods and system outage, 
an empirical relation was determined between the 12-GHz attenuation 
statistics and 19-GHz COMSTAR beacon attenuation statistics. 

The propagation paths through the atmosphere from the two satellites 
and the signal polarizations are different. The CTS beacon signal is cir- 
cularly polarized and at Holmdel requires a ground station antenna point 
at approximately 234 degrees azimuth and 27 degrees elevation. The 
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Fig. 12—Plot of 12-GHz attenuation as a function of 19-GHz attenuation at constant 
outage periods for July 29, 1976 to August 27, 1976. 


COMSTAR A beacon signal is linearly polarized and requires an antenna 
point of approximately 244 degrees azimuth and 18 degrees elevation. 
The frequency scaling presented here, therefore, only applies to these 
two experiments. 

The frequency scaling of the attenuation statistics was done by relating 
the decibel levels of the 12- and 19-GHz cumulative attenuation distri- 
bution curve at equal outage times. A common measurement period was 
chosen when both systems were operating and where sufficient rainfall 
occurred to produce significant attenuation at the two frequencies. This 
period, between July 29 and August 27, 1976, included several rain 
events, providing an ensemble average over multiple events. Figure 11 
shows the measured cumulative attenuation statistics at 12 and 19 GHz. 
Using the data from this figure, another curve was plotted showing the 
attenuation at 12 GHz as a function of attenuation at 19 GHz at constant 
outage times. This frequency relationship is shown in Fig. 12. This curve 
appears to have segments with two distinctly different slopes. The 
change in slope at lower attenuations may be due in part to digitizing 
errors of the 12-GHz data. In the region above 10 dB attenuation at 19 
GHz, the curve has a constant slope of 0.36. This shows, for a given outage 
time, the attenuation observed at 12 GHz will be 0.36 times the decibel 
attenuation at 19 GHz for the same measurement period. 
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Fig. 13—Cumulative attenuation statistics at 12 GHz with data scaled from 19 GHz 
for June 24, 1976 to July 17, 1976. 


To test the accuracy of this rule, another period of data was analyzed. 
The cumulative attenuation statistics for 12 and 19 GHz were calculated 
for the period between June 24 and July 17, 1976. The 19-GHz data were 
scaled to 12 GHz using the previously determined factor. Both data are 
plotted in Fig. 13. The two sets of data are almost indistinguishable over 
most of the distributions, which lends reasonable confidence to the 
scaling technique. 
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This paper addresses the simulation and detection of logic faults 
in CMOS integrated circuits. CMOS logic gates are intrinsically tri-state 
devices: output low, output high, or output open. This third, high- 
impedance condition introduces a new, nonclassical logic fault: the 
“stuck-open.” The paper describes the modeling of this fault and its 
complement, the stuck-on, by means of gate-level networks. In addition, 
this paper provides a methodology for creating simulator models for 
tri-state and other dynamic circuit elements. The models are gate-level 
in structure, provide for both classical and stuck-open/stuck-on faults, 
and can be adopted for use on essentially any general purpose logic 
simulator. 


l. INTRODUCTION 


The challenge of testing silicon integrated circuits (ICs) is becoming 
more formidable with the rapidly expanding production of large-scale 
integrated (LSI) circuits. Increased gate-count, increased pin-count, 
smaller feature size, higher performance, and higher complexity all 
contribute to a mounting “testability” problem. Furthermore, there is 
considerable evidence that the economic requirements to meet that 
challenge will continue to grow at a rate markedly greater than that of 
circuit size alone. 

As a further dimension to the challenge, IC tests must be specifically 
designed to recognize failure-mode dependence upon circuit configu- 
ration, processing parameters, and the overall technology (TTL, ECL, 
PMOS, CMOS, etc.). That is, a Boolean network realized in one technology 
can have a strikingly different implementation in another. Consequently, 
logic tests must be created which exercise not only the gross functional 
behavior of the IC but also the structure used for that function. However, 
for large-scale ICs, internal circuit structure and complexity are in- 
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creasing at a much more rapid rate than is the number of access termi- 
nals. 

The rising use Pe MOS technology for LSI circuits has introduced a 
number of circuit elements whose logical behavior and faults are gen- 
erally not treated by existent logic simulators.! These include, for. ex- 
ample, transmission gates, tri-state inverters, and bidirectional buses. 
Furthermore, the failure modes of such circuits and even those of ordi- 
nary combinational logic gates can introduce nonclassic logic faults. That 
is, they possess a faulted behavior for which test coverage would not be 
verified on a conventional fault simulator. 

The focus of this paper will be centered upon fault modeling and logic 
simulation of CMOS digital integrated circuits. The motivation for this 
direction is the recent emergence of CMOS as a mature technology for 
the design of densely-packed, low-power digital LSI circuits.2 Secondly, 
CMOS is intrinsically a three-state logic technology. Consequently, it 
readily lends itself both to the illustration of the new, nonclassical logic 
faults and to a methodology for modeling the dynamic nature of MOS 
circuits. 


ll. CMOS LOGIC FAULTS 
2.1 CMOS logic gates 


Figure 1 shows a two-input CMOS NOR gate: the output is high if and 
only.if A = B = 0. The realization of the NOR function shows the 
series/parallel complementary nature of CMOS logic gates: F = A-B 
and F = A + B where the Boolean function A - B connects the output to 
the “1” level and the function A + B connects the output to the “0” level. 
Each is the complement of the other and is implemented, respectively, 
with p-channel FETs and n-channel FETs. 

The NOR circuit is a specific example of the general CMOS character- 
istic of complementary pull-up/pull-down networks. The only two 
steady-state logic outputs are 0 and 1. The former arises when the 
pull-down network is conducting and the pull-up network is noncon- 
ducting. The latter, F = 1, occurs when the two networks reverse their 
conductivity states. Consequently, there is no static current path be- 
tween VDD and VSS, and CMOS ICs therefore dissipate power only to 
charge and discharge circuit capacitance. 

On the other hand, there are two common situations which can lead 
to a third logic state. This third condition is the “open” or high- 
impedance state.? One source of the “open” state is the presence of a logic 
fault which prevents one network from conducting when the other is in 
a nonconductive state. A second cause is the legitimate use of a high- 
impedance state in dynamic circuits or tri-state buffers, for example. 
In each instance the output retains the logic value of the previous output 
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Fig. 1—The CMOS two-input NOR gate: (a) logic symbol and transfer function, and 
(b) FET realization. 


state. This is true because the gates are loaded with capacitance only. 
The length of time the state is retained, however, is determined by the 
leakage current at the node. 

Conceptually, a fourth state could exist: both networks conducting. 
However, this represents a logical inconsistency, i.e., the output cannot 
simultaneously be both high and low. Such a fault is more analog in na- 
ture because the output voltage lies somewhere between VDD and VSS. 
The actual value is determined by the impedance ratios of the networks 
and the associated fault. For the most part these failures will not be 
treated as logic faults. 

There are two kinds of classical logic faults: stuck-at-one (SA1) and 
stuck-at-zero (SAO). These faults may occur at either an input or an 
output of a logic gate. On the other hand, a gate with n inputs can have 
only n + 2 distinct classical faults. Input faults must be stuck in the 
“nondominant” state to be distinguishable from an output stuck-at fault. 
For AND/NAND gates and OR/NOR gates such nondominant faults are 
SAl’s and SA0’s, respectively. These faults are sometimes called 
“input-open-from” faults and will be denoted as IOF faults in this paper. 
The two-input NOR gate of Fig. 1 has the four classical faults: SAO, SA1, 
IOFA, and IOFB. These are also symbolized as F(0), F(1), F(A), and F(B), 
respectively. 

For CMOS logic gates the nonclassical “‘stuck-open” faults must be 
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Table | — CMOS two-input NOR gate: truth table 


A B F F F F F F F F 
(0) (1) (A) (B) (ASOP) (BSOP) (VDDSOP) 
0 oO Jj 0 1 1 1 1 1 4 
0 1 0 0 1 0 1 0 4 0 
1 0 0 0 1 1 0 4 0 0 
1 1 QO 0 1 0 0 0 0 0 
normal classical faults nonclassical faults 


4 = previous output state 
F(A) = F(IOFA) 
F(B) = F(UOoFsB) 


included to represent the undesired, high-impedance state caused by 
a faulty pull-up or pull-down network. For the two-input NOR gate of 
Fig. 1, there are three such stuck-open (S-OP) faults: ASOP, BSOP, and 
VDDSOP. The first, ASOP, is caused by an open, or missing, n-channel 
A-input pull-down FET. The second, BSOP, is caused by an open, or 
missing, B-input pull-down FET. The third, VDDSOP, is caused by an 
open anywhere in the series, p-channel pull-up connection to VDD. 

Table I shows the truth table for the two-input CMOS NOR gate for 
both the fault-free and the seven faulted conditions. For example, the 
fault-free gate obeys 





F=(A+B) 
whereas 
F(1oFB) = A 
and 
F(asop) = A-B+4-A-B, 


where ‘‘4” denotes the previous state of F. [Using the notation of se- 
quential circuit design, the latter equation would read F,41(ASOP) = 
A-B+F,-A~-B. The use of a “4” to symbolize F,, is a convention 
adopted to describe the effect of S-OP faults. | 

How are the seven NOR gate logic faults related to actual physical flaws 
in the IC? The SAO, SA1 faults correspond to a low-impedance “short” 
to VSS or VDD, respectively. The IOF faults are caused by an open input 
to the logic gate as a whole. In addition to being open, the input is in a 
charged condition which is recognized as a logic 0. (An JOF fault in a NOR 
gate is an SAO, by definition; in a NAND gate the analogous fault would 
be an SAI, of course.) That is, both the p-channel and the n-channel FETs 
have a 0 applied to them. On the other hand, the nonclassical S-OP faults 
arise from a missing connection to the gate of individual FETs, for ex- 
ample, with the gate in a charge state such that the FET is nonconducting. 
Another cause of an S-OP fault is an open, or missing, connection to either 
the source or the drain of an FET. 
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Table !l — CMOS two-input NOR gate: fault detection 
(test sequence: AB = 00,01,00,10) 


A B F F F F F F F F 
(0) (1) (A) (B) (ASOP) (BSOP) (VDDSOP) 
1 0 OO 1 0* 1 1 1 1 1 3 
2 0 1 O 0 og 0 I 0 1* 0 
3 0 0 41 0 1 1 1 1 1 o* 
4 1 0 O 1* 


0 1 1* 0 0 0 
normal classical faults nonclassical faults 


3 = unknown output state (0 or 1). 
* Vector at which simulator detects the fault. 


In this context an “open” denotes an undesired high impedance at 
either the gate, the source, or the drain of an FET. Of course, any residual 
capacitive or resistive coupling must be negligibly small for a high- 
impedance fault to be regarded as a true “open.” In addition, the actual 
occurrence of such faults depends on the specific topology of the logic 
gate. 

The truth table of Table I shows that S-OP faults create sequential 
circuits where only a combinational circuit existed for the fault-free gate. 
This increases the difficulty of both testing the circuit and designing a 
set of input test “vectors” to achieve a high percentage of fault coverage. 
For example, the four input states of Table I, if applied in that order, will 
detect only 5 out of the 7 logic faults. ASOP and VDDSOP will be unde- 
tected. The ASOP fault is not detected because F'(10) = “4” = 0 which 
is the correct output for the “10” input vector. The VDDSOP fault will 
not necessarily be detected because of the chance that the gate powers 
up with the output in the high (but correct) state for the “00” input 
vector. 

The primary reason that the above two faults were undetected is that 
the corresponding circuit paths (devices) were not tested to determine 
whether they fulfilled their most basic function. For example, to test the 
A-input n-channel pull-down FET, the output node must first be driven 
high and then that FET, and it alone, must be capable of pulling the node 
low. The sequence of inputs in Table I did not meet that condition. 

A set of vectors which detects all 7 faults is the following: 00,01,00,10. 
Table II shows the response of the 8 circuits (the good circuit and the 
7 faulty ones) to the above input sequence. The * symbol designates the 
vector at which each fault is first detected by the simulator. The “3” 
denotes an unknown state (either a 1 or a 0), caused by the VDDSOP fault 
as described previously. Depending upon the actual state of the circuit 
at power-up, the VDDSOP fault may, therefore, be detected at either 
vector 1 or 3. 

The influence of CMOS faults on fault coverage is shown in Fig. 2. The 
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Fig. 2—The CMOS two-input NOR gate: probability of fault detection (for a simulator) 
versus the length N of a sequence of random input vectors. 


probability of detection for random vector sequences is given for each 
of the 7 faults in Table I. The probability is that which would prevail for 
a logic simulation of a two-input NOR gate subjected to random vector 
sequences of length N. The output SA1 fault (F(1)) has a probability of 
detection of nearly unity for sequences of length greater than three. The 
two pull-down stuck-open faults have the lowest probability, reaching 
only 0.29 for sequences 6 vectors long (45 vectors are required to reach 
0.95). Note also the similarity between the VDDSOP fault and the output 
SAO fault (F(0)). Further, the lag for all three CMOS faults is evident. 
In addition to increasing the number and complexity of CMOS logic 
faults, stuck-open faults are also timing-sensitive. Specifically, the above 
set of input vectors will detect three S-OP faults of the NOR gate only if 
they are applied to the gate at a rate more rapid than that associated with 
leakage current time-constants. A rate significantly slower than, say, 
10 kHz may allow some faulty devices to charge to the correct state before 
the output is sampled by the test set. Truth-table testing at quasi-dc 
rates is inadequate. Conversely, an ill-chosen vector set, such as the bi- 
nary sequence of Table I, may not detect S-OP faults no matter how fast 
it is applied to the device under test. Note also that the sequence of Table 
I is “exhaustive,” but does not achieve 100 percent fault coverage. 
Another aspect of stuck-open faults is that of long-term reliability. 
A fault caused by a missing connection to the gate of an individual FET 
may cause that FET to be open and yet remain undetected during pro- 
duction testing if the test vector set has less than 100 percent fault cov- 
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erage. Later, however, under actual operating conditions the FET gate 
may acquire charge of the opposite polarity and become conducting or 
“stuck-on.” CMOS S-OP field failures have been observed.* 


2.2 Modeling CMOS logic faults 


The design of a set of vectors to achieve 100 percent fault coverage for 
small-scale integrated (SSI) circuits such as NAND and NOR gates is trivial 
and can be done by inspection. However, for complex high gate-count 
circuits such as medium- and large-scale ICs (MSI, LSI), the use of 
computer-based logic simulators is a necessity. To meet this need the 
simulation of both stuck-open faults and dynamic logic elements has 
been approached from the standpoint of circuit modeling. The models 
represent circuit elements both in their fault-free condition and in 
high-impedance state(s), if any. Combinational “static” gates (e.g., NAND 
or NOR) enter the high-impedance condition only in the presence of S-OP 
faults. Dynamic or tri-state logic elements, however, can intentionally 
be placed into a high-impedance state by auxiliary, or control, inputs. 

The models presented in this paper are gate-level oriented because 
no other method generally exists for simulating nonclassical logic faults. 
On the other hand, the logical behavior of the models can be incorporated 
into a higher-level functional description if that capability is available 
on the logic simulator. On the LAMP system! the Function Description 
Language (FDL) is being modified to include such “internal faults.” ® 
Although the models in this paper are implemented in terms of NAND/ 
NOR logic, they do have the advantage of being simulator independent. 
That is, they will correctly model fault-free and faulted logic networks 
regardless of the particular simulator chosen. The specific illustrations, 
however, are taken from the author’s experience with LAMP. 

Existent machine aids simulate for the most part only the classical 
SA0/SA1 faults and not the “stuck-open” faults. One possible solution 
to the problem is to use a network of conventional gates (NOT, NAND, 
NOR, etc.) to form a model which duplicates both the normal and the 
faulted behavior of a single CMOS gate. One of the basic properties of the 
model is that it must possess the capability of passing 0/1 data from input 
to output in accordance with the fault-free logic function of the gate. 
Second, when there is an S-OP fault, it must retain the previous state in 
the presence of the “provoking” input (see Table I). This suggests the » 
use of gated latch. 

The general approach to modeling either stuck-open (S-OP) faults or 
dynamic gates is shown in Fig. 3. The “gated latch” represents the nodal 
capacitance associated with the logic function. For T = 1, the output 
equals the input (Z = D); the T = 0, the output latches and stores the 
previous state (Z = “4’’). The “node faults gate”’ has been added to in- 
troduce the two classical SAO and SA1 (stuck-at) faults. The gate is not 
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Fig. 3—Block diagram: the general approach for stuck-open/dynamic gate models. 


necessary if those faults are incorporated within the “logic function” 
network. The latter represents the actual logical operation being modeled 
(NAND, NOR, etc.). The “control and fault network” establishes whether 
the output of the “logic function” is to be transmitted to the gates driven 
by the “output.” In the case of static gates, there are no “control inputs” 
to that network. Only the presence of S-OP faults would cause T = 0 (i.e., 
no transmission). Dynamic circuit elements have “control inputs” which 
can cause T' = 0 in the absence of faults. Classical faults generated in the 
“logic function” network at input D will, of course, propagate through 
the latch (and not through the S-oP fault generating network). 


ill. SPECIFIC MODELS 
3.1 The NOR gate 


The model for the two-input CMOS NOR gate is shown in Fig. 4. The 
NOR gate named GATE has zero delay but is faulted (i.e., the fault sim- 
ulator will simulate faults for this gate) so as to introduce all the classical 
faults of a two-input NOR gate. Of course, GATE represents the modeled 
logic function. All other gates in Fig. 4 compose the fault-generating 
network which sets T = 0 in the presence of an S-OP fault and the 
“provoking” input for that fault. 

The model functions as follows: If there are no faults in the circuit, 
then F = (A + B). If there are only classical faults (from GATE), then 
T = 1 and those faults propagate through GL. If there is an S-OP fault 
in the gate, than T = 0 for the provoking vector and F = 4 as required 
from Table I. For example, because its input is grounded the only fault 
that the simulator assigns to the gate ASOP is an SA1. Therefore, in the 
presence of the fault ASOP(1) and when AB = 10, then T = 0 and the GL 
latch holds the output equal to the previous value. Similarly the output 
F will be stuck-open in the presence of the VDDSOP(1) fault if and only 
if AB = 00. 
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Fig. 4—The CMOS two-input NOR gate model, NR2. The block labeled GL is the gated 
latch circuit of Fig. 3. Only those gates marked * are faulted by the simulator. Propagation 
delays are denoted by either 0 or 1. 


The NOR gate model of Fig. 4 has been designed to yield the correct 
logic behavior of the two-input CMOS NOR gate in the absence of faults. 
Secondly, it correctly models all 7 logic faults: SAO, SA1, IOFA, 
IOFB, ASOP, BSOP, and VDDSOP. Thirdly, it generates no spurious faults 
associated with the modeling, nor does it have a propagation delay other 
than the one unit that would be expected from a single NOR gate. Finally, 
the nature of the fault-generating network prevents the spurious prop- 
agation of faults from other gates connected to the inputs of the gate in 
question. 

The names of the gates in the model have been chosen to provide ease 
of use during simulation. Specifically, if a circuit contained a two-input 
CMOS NOR gate with the name NOR17, then the LAMP simulator would 
assign the following fault list to it: 


NOR17.GATE(0) 
NORL7.GATE(1) 
NOR17.GATE(A) 
NOR17.GATE(B) 
NOR17.ASOP(1) 
NOR17.BSOP(1) 
NOR17.VDDSOP(1) 
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4 = PREVIOUS STATE OF F 


Fig. 5—The CMOS transmission gate: (a) logic symbol, (b) FET realization, and (c) the 
static, unilateral logic transfer function. 


Consequently, the test engineer can determine at a glance the nature of 
a fault and where it is located. In addition, in the LAMP system all S-OP 
faults can be globally nonfaulted (or faulted) because they arise from 
gates ending in the string “SOP.” This nonfaulting capability is useful 
in determinations of relative fault coverage. 


3.2 The transmission gate 


MOS technology possesses an interesting circuit element with both 
digital and analog capability: the transmission gate. The CMOS trans- 
mission gate is shown in Fig. 5.° It is a bilateral device with a conducting 
mode for PN = 01 and a nonconducting state for PN = 10. A problem 
occurs for the two other vectors PN = 00 and PN = 11. In each case either 
the p-channel or the n-channel FET will be on, but not both. Low-to-high 
transitions-(open p-FET) or high-to-low transitions (open n-FET) will 
be attenuated as they pass through the transmission gate. In addition, 
there will be an accompanying speed degradation caused by the higher 
impedance of the single FET path. 

Although the two input vectors 00 and 11 apply abnormal conditions 
to the transmission gate, the choice of whether to regard the gate as a 
whole as either “on” or “off” is somewhat arbitrary. For these particular 
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Fig. 6—The CMOS two-channel transmission gate node, XG2: (a) logic symbol and (b) 
a gate-level model which exhibits both the fault-free and the faulted behavior of the XG2 
a Signal flow is unilateral: from A and B to F. Normally only one channel is enabled 
at one time. 


models the worst-case logic behavior of the gate has been taken to be 
F=A-P-N+4-(P+N) 


where “4” represents the previous state of the output. The above ex- 
pression can be rewritten as 


F=D-T+4-T 


where T'= P-N and D = A. This is just the equation for the gated latch 
of Fig. 3, i.e., the gated latch is equivalent to a nonfaulted transmission 
gate. . 

Functionally, transmission gates generally occur in groups of two, 
three, four, etc. Hence, they can be regarded as multiplexing nodes at 
which usually only one of the gates is conducting at a time. The model 
for the CMOS two-channel transmission gate node, used in the above 
sense, is shown in Fig. 6. The gate named SUM is a zero-delay nonfaulted 
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g. 7—The CMOS tri-state inverter: (a) the FET realization, (b) the truth table, and 
ey ee logic symbol. 


tied-OR node (TOR). The purpose of the gate labeled NODE is to provide 
the two classical faults SAO, SA1 associated with the node F. The non- 
classical faults generated by this network are four in number: ASOP(0), 
ASOP(1), BSOP(0), and BSOP(1). As usual, ASOP(1) means that the A- 
channel is stuck-open and, conversely, ASOP(0) means that it is 
“stuck-on” (S-ON). For the latter fault, F = A + B- (PB) - (NB). Here, 
the stuck-on has been treated as a legitimate logic fault whose presence 
induces a “‘1-dominant” short, i.e., the spurious A-input is ORed with 
the correct B-channel response. Of course, there may be no technological 
reason to assign a S-ON fault as either 1-dominant or 0-dominant. In that 
case, the model in Fig. 6 (and others) can easily be recast to exhibit only 
S-OP nonclassical faults. 

Stuck-open faults in a CMOS transmission gate are also timing- 
sensitive as in NAND and NOR gates, but in a different manner. Specifi- 
cally, even with one FET of the pair open, the other can provide nearly 
a complete logic transition but at a higher average impedance level. 
Consequently, 0/1 data will propagate through the gate but at slightly 
slower speeds. However, the reduction in speed is much smaller than that 
caused by S-OP faults in NAND or NOR gates. Therefore, an S-OP fault 
in a transmission gate may be intrinsically undetectable even at the 
highest data rates that occur at a particular gate. 
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_ Fig. 8—The CMOS tri-state inverter model. Only the gate marked * is faulted by the 
simulator. Propagation delays are denoted by either 0 or 1. 


3.3 The tri-state inverter 


The FET implementation of the CMOS tri-state inverter is shown in 
Fig. 7(a). In this case P and N are control leads that determine whether 
the circuit inverts the input A or remains in the high-impedance (‘‘4’’) 
state. For the latter condition, the nodal capacitance retains the value 
of the previous state (“0,” “1,” or “3’’). The truth table is given in Fig. 
7(b) and leads to the transfer function: F = A-P + 4-(A-P + A-N). The 
logic symbol is shown in Fig. 7(c). 

The model for the tri-state inverter is given in Fig. 8. Here,-only the 
two classical faults F(0) and F(1) have been assigned to the model be- 
cause of the similarity of VDDSOP/VSSSOP faults to F(0) and F(1) faults, 
respectively. The S-OP and S-ON faults could be modeled, of course, but 
the added complexity does not warrant separate treatment. 

The relation of the model network to the truth table is perhaps more 
obvious if the transfer function is written as 


F=1-A-P+0-A-N+ 4-(A-P+ A-N). 


The terms A-P and A-N then compose the transmit (7) function: T = 
A-P + A-N. Therefore, when T = 0 then F = “4.” That is, the output 
latches (or stores) the previous state. 


3.4 Modified tri-state inverter 


The addition of an n-channel transmission gate FET to the output of 
the tri-state inverter forms a ‘“‘modified” inverter of Fig. 9(a). (This 
circuit element forms one half of a gated sense amplifier.) The resultant 
truth table is shown in Fig. 9(b). The “3” (unknown 0 or 1) state occurs 
whenever A = B and both the inverter and the transmission gate are 
enabled. In other words, whenever each attempts to drive the node F to 
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Fig. 9—The CMOS modified tri-state inverter: (a) the FET realization, (b) the truth table 
where 3 indicates an unknown 0 or 1 state and 4 symbolizes the previous state of F, and 
(c) the logic symbol for the combination tri-state inverter and n-channel FET. 


opposing logic states, the output is indeterminate (‘‘3”). The high im- 
pedance “‘4” state occurs whenever both channels are disabled. 

The model for the modified tri-state inverter can be developed by 
noting that the Boolean expression that selects the A channel is P-A + 
N-A and the term that selects the B channel is E [see Fig. 9(a)]. There- 
fore, define SA = P-A + N-A and SB = E. Then, T = SA + SB or T = 
SA-SB. Therefore, 


F=A-SA-SB+B-SA-SB+3-SA-SB+4-SA-5 
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Fig. 10—The CMOS modified tri-state inverter model. 


or : 
F=(A+SA+SB)-(B +SA + 5B) -(3+SA + 5B) -(4+SA+4+ SB). 


The model corresponding to the latter expression for F is shown in 
Fig. 10. The model has four faults assigned to it: two classical faults 
NODE(0,1), and two CMOS faults BSOP(0,1). For example, the fault 
BSOP(1) means that the B-channel transmission gate is stuck-open and 
nonconducting. S-OP faults in the inverter itself are ignored (see Section 
3.3, above). 

The model of Fig. 10 produces a “3” output whenever conflicting 
output conditions are generated by the simultaneous selection of both 
the A channel and the B channel. The “3” capability may be important 
during design verification of the fault-free circuit behavior. On the other 
hand, if it is known that mis-selection is unimportant or not possible, 
then the model can be reduced to that shown in Fig. 11. 

It would be pointless to introduce into either model a fault whose 
sole effect would be the generation of a “3” output. For fault simulations 
“3” outputs are effectively ignored. A fault whose only result is to pro- 
duce a “3” would be undetectable on a logic simulator. 


3.5 Input/output port 


Some integrated circuits contain pins which can serve as both input 
and output ports. Figure 12(a) shows the physical structure of one such 
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Fig. 11—The CMOS modified tri-state inverter model: reduced version. The unknown 
3 condition has been ignored and replaced by a 0-dominant short assignment. 





pin. IOPIN is the actual connection to the external world, FO represents 
all fan-out loads from the node, and FI represents all fan-in (tri-state) 
devices at the node. The modeled structure is shown in Fig. 12(b) where 
the IOPIN has been divided into separate IN and OUT functions. The D 
and E variables have been introduced as the control inputs which define 
the state of the input driver and the FI branch, respectively. 

The truth table of Fig. 12(c) defines the relation between D, E, and 
OUT. Note that the impedance of the input driver (of the test set) is taken 
to be much less than that of the FJ branch. Consequently, whenever the 
node is driven from an external source (D = 1), the logical state of the 
driver overrides that of FI. 

Under the above assumptions the gate-level model for the I/O port 
can be constructed as shown in Fig. 13. The model has been assigned four 
faults: the usual output SAO, SA1 faults and two S-OP/S-ON faults asso- 
ciated with the FI branch. Ideally each fan-in branch at the node would 
have two such faults. However, in the absence of a detailed knowledge 
of the fan-in network, just two faults have been indicated. 


3.6 Tri-state bilateral buses joined by a transmission gate 


The characteristics of the simple I/O port described above can be 
extended to more complex networks. One example is shown in Fig. 14. 
Two tri-state bilateral buses are connected by a bilateral transmission 
gate. All the data sources (“talkers”) are grouped on the left and all data 
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Fig. 12—Input/output port: (a) physical structure, (b) modeled structure, and (c) truth 
table. IOPIN is the actual connection to the outside world, FO represents all fan-out loads, 
and FY represents all fan-in (tri-state) devices. D and E define the state of the input driver 
and the FT branch, respectively. The symbol 4 denotes the previous state of OUT. 


sinks (“listeners’’) on the right. A device that can be both send and re- 
ceive would be represented once in each group. 
The models described previously in this paper have all been inde- 





— = ENABLE CONTROL FOR Ft Q= ZERO DELAY 
D = ENABLE CONTROL FOR IN 1= UNIT DELAY 
* = FAULTED 


Fig. 13—Input/output port model. Only gates marked * are faulted, 0 and 1 denote 
delays, and D and E are enable controls for IN and FI, respectively. FI = fan-in; FO = fan- 
out. 
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Fig. 14—Two bilateral tri-state buses connected by a transmission gate. All data sources 
(drivers) are grouped on the left. All data sinks (gate inputs) are to the right. 


pendent of each other. Therefore, they can be organized as a library of 
subnetwork building blocks. Consequently, it would be tempting to 
model the network of Fig. 14 as an interconnection of three independent 
models: one for the A-bus, one for the B-bus, and one for the bilateral 
transmission gate. However, in this instance, that is not possible. A 
successful model must incorporate features from all three to correctly 
represent the true bilateral interactions between each bus by way of the 
transmission gate. 

The model for the fault-free behavior of the A-bus output is shown 
in Fig. 15. The interbus coupling is modeled by means of the EXG-EB AND 
gate and the B-SB AND gate. For example, if the A-bus driving sources 
are in the high-impedance state (EA = 0) and the B-bus is in the low- 
impedance sourcing mode (EB = 1), then when the transmission gate is 
enabled (EXG = 1), the A-bus output takes the same value as that present 
on the B-bus. Conversely, information can travel from the A-bus to the 
B-bus. The model for the B-bus output has not been shown because it 
is analogous to that of the A-bus: In Fig. 15, wherever the symbol A ap- 
pears substitute B and vice versa. The fault-free behavior of the bus 
model as compared to that of the “actual” buses is given in Table III. 
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Fig. 15—The A-bus fault-free model. The unknown 8 condition is ignored here. The 
B-bus model is analogous. 


Table Ill — Fault-free behavior.of the coupled buses 
and their model 
Model Actual Buses 
EXG EA; EB; A B A B 
0 0 0 4 4 4 4 
0 0 1 4 DB; 4 DB; 
0 1 0 DA; 4 DA; 4 
0 1 1 DA; DB; DA; DB; 
1 0 0 4 . a 
1 0 1 DB; DB; DB; DB; 
1 1 0 DA; DA; DA; DA; 
1 1 1 DA; + DB; DA; + DB; ae * 
” A B A B 
0 0 0 0 
0 1 3 3 
1 0 3 3 
1 1 1 1 
present next 


EXG, EA;, EB; = enable control inputs 
DA;, DB; = data inputs 

A, B= bus outputs 

3 = unknown 0 or 1 
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Fig. 16—The A-bus faulted model. Only gates marked * will be faulted by the simulator. 
Two SA0/SAI node (pin) faults and m + 1 pairs of stuck-open/stuck-on faults have been 
assigned. The B-bus faulted model is analogous. 


One possible faulted model for the A-bus is shown in Fig. 16. For 
multiple selections among the DAJ and B the response is that of a 0- 
dominant short. The “3” output could be substituted, however. All el- 
ements are zero delay except the gated latch. The usual faults have been 
assigned to the model: two SA0/SAI classical “pin” faults and m + 1 pairs 
of S-OP/S-ON faults. As before, the faulted model for the B-bus output 
is obtained by the symbolic interchange of As and Bs. 


3.7 The programmable logic array (PLA) 


The programmable logic array (PLA) is a simple method for imple- 
menting “random” combinational logic networks. An example of a 
three-variable, three output PLA is pictured in Fig. 17. The circuit is 
implemented in dynamic “pseudo NMOS.” 7 That is, only the pull-up, 
or precharge, FETs are PMOS; all others are NMOS. 

Clock $1 precharges the word and bit lines to the 1 state. Next, clock 
2 causes the input signals x, y, z to propagate to the output terminals 
W1, W2, W3. (Although the two clock waveforms are essentially in 
phase, they are applied to PMOS and NMOS FETs, respectively, and the 
resulting conduction modes are 180° out of phase.) 
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Fig. 17—Dynamic pseudo NMOS programmable logic array (PLA). There are three input 


variables (x, y, z) and three outputs (W1, W2, W3). Pull-up FETs are PMOS; all others are 
NMOS.? 


The steady-state transfer functions are: 
Bl=x-y, B2=z, B3=x-y, 
W1=B2+B3, W2=B1+B3, W3= W31- W332, 
. W381 = B1 + B2, W32 = B3. 
Therefore, 
W1=Zz+%x-y, 
W2=x “y+tx-y, 
W38=X-y-ztx-Vez. 


Models for the example PLA can be constructed by noting that the 
relation between inputs and bit lines and between bit lines and word lines 
is essentially that of a dynamic NOR gate. Figure 18 shows several 
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Fig. 18—Subcircuit portions of the PLA of Fig. 17. 


“subcircuit” portions of the PLA. Figure 19(a) gives the model for one 
typical subcircuit, W2. Figure 19(b) gives the model for the more complex 
W3 output. By this method the characteristics of any PLA, including 
dynamic properties, can be represented as a sum of simpler subcircuit 
models. 


IV. THE GENERAL CASE 


The preceding examples are particular illustrations of a more general 
configuration, shown in Fig. 20. The capacitance implicitly associated 
with the node F will be in one of four possible states: ({) low-impedance 
0, (iz) low-impedance 1, (iii) high-impedance 0, or (iv) high-impedance 
1. The first two conditions occur whenever one of the four input channels 
are enabled (conducting). The latter two cases arise when no circuit is 
enabled and the output is the resultant high-impedance state (F = 4). 

In terms of the model, the logic state of the network is “transferred” 
to node F if and only if T = 1 where 


T = >~ EX; - EXSOP;. 
i 
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Fig. 19—PLA subcircuit models. Gates marked * are faulted by the simulator, and 0 and 
1 denote delays. 


That is, at least one of the input circuits must be enabled (EX) = 1) and 
that circuit must not be stuck-open, i.e., EXSOP) = 0. Otherwise, T' = 
0 and F = 4. 

If two channels, A and B, are simultaneously enabled (A&B), then 
the result is said to be a 0-dominant short if A&B = A-B. For the 
1-dominant short, A&B = A + B. These two cases are shown in Figs. 21 
and 22, respectively. Of course, if stuck-on faults are ignored entirely, 
then the network for T is represented by the summation immediately 
above. 

Logic faults are assigned in the following manner. Simple n-input 
combinational gates are given the usual n + 2 stuck-at faults. In addition, 
one stuck-open fault is associated with each parallel branch of the 
pull-up/pull-down networks. Stuck-opens in strictly series paths to either 
VDD or VSS are ignored because of their similarity to SAO/SA1 faults, 
respectively. Unless the details of circuit technology and design dictate 
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WHERE EA =CA,EB=CB1 e« CB2 
EC=CC, ED =CD1« CD2« CD3 


Fig. 20—The general case: multiple series/parallel combinations driving one node. FETs 
on the upper side are p-channel; those below are n-channel. 


a clear choice, stuck-on logic faults should generally be disregarded. 
Nodes at which several tri-state elements connect can be assigned the 
two SA0/SA1 classical faults and one S-OP fault per channel. Fault con- 
ditions producing the unknown “3” state should not be modeled. On the 
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Fig. 21—The general case: S-OP and S-ON faults (0-dominant). 
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Fig. 22—The general case: S-OP and S-ON faults (1-dominant). This is a variant of the 
model of Fig. 21 except that the S-ON fault is 1-dominant. 


other hand, simulation models for design verification (“true value” 
simulation) can include the “3” state as an indication of erroneous tri- 
state selection as in bus-oriented circuits. 


V. SUMMARY 


This paper has described a procedure for modeling faults which are 
a peculiarity of CMOS digital integrated circuits. Furthermore, the re- 
sultant methodology was also utilized to provide simulator models for 
complex MOS dynamic circuit topologies. The models are gate-level in 
structure and can be adapted for use on essentially any general purpose 
logic simulator. In addition, the models have been chosen to avoid faults 
which are artifacts of the model and which do not represent physically 
likely logic defects. The models have also been structured to preserve 
the distinction between classical and nonclassical faults. In addition, 
all models are designed to avoid races and circuit oscillations. 

From the examples in this paper, it can beseen that there are a number 
of choices that can be taken for modeling a given logical function. In 
addition, a particular function may well be reduced in gate-count from 
the examples and the “general” realizations shown in this paper. That 
is, depending upon the selection of characteristics most important to 
the user, different models will result. In that spirit, the illustrations used 
above were selected to demonstrate the principles of modeling in a clear, 
straightforward manner. 
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A theoretical expression is derived in this paper that evaluates the 
effectiveness of a set of logic tests for digital integrated circuits. The 
validity of the proposed figure of merit is examined with experimental 
data from CMOS integrated circuits. In addition, the importance of 
simulating the nonclassical stuck-open/stuck-on CMOS logic faults is 
also studied. 


1. INTRODUCTION 


The ever-growing complexity of digital integrated circuits places in- 
creasing emphasis upon the use of computerized design aids. Because 
no integrated circuit design is complete without an accompanying set 
of tests, one essential tool:is the logic simulator.! The two principal 
reasons for logic simulation are (i) to verify the logic design and (i) to 
develop the set of tests. A third purpose, related to the second, is that 
of diagnosis, i.e., identification of logic faults causing specific yield 
problems. 

This paper will address itself to a study of the relation between fault 
coverage and measured yield and will consider specifically CMOS inte- 
grated circuits. The latter choice was made for two reasons. First, CMOS 
ICs are an attractive choice for many system designs. Second, CMOS ICs 
can possess nonclassical logic faults peculiar to MOS circuit elements: 
stuck-opens and stuck-ons.? 

To verify the logical behavior of the IC, the test engineer usually begins 
with binary “vectors,” or test patterns, that test the basic input/output 
logic functions of the circuit. For example, if the IC is a multiplexer, then 
multiplexing different data patterns is a natural starting point. Designing 
a set of vectors for high fault coverage generally represents a larger 
challenge than that of design verification. The difficulty arises because 
the logical structure of the IC must be tested and not just its generic 
properties, such as multiplexing. The major disadvantage of “behavioral” 
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tests is that they are usually too lengthy.*4 Consequently, the simplest 
approach is to begin with a sequence of representative behavioral tests 
and then add to them the necessary “structural” tests to bring the fault 
coverage to the required level. In any event, the process of developing 
the digital tests should start as soon as the systems logic design is for- 
mulated and before the design reaches the mask layout phase. 


ll. FAULT COVERAGE AND MEASURED YIELD 


Perhaps one of the most important questions for test vector devel- 
opment is: How much fault coverage is enough? The answer must ob- 
viously be related to the intrinsic yield of the IC under test. For example, 
if the yield is 100 percent, then any low-coverage vector set can be used, 
including none at all. On the other hand, if the yield is low, then there 
will be many defective ICs that can potentially masquerade as “good” 
devices if the fault coverage is poor. In the latter case, the lower the fault 
coverage the more probable it will be that a chip that tests “good” con- 
tains a fault. 

In any event, the measured functional yield,* ym, is the sum of two 
components: the actual functional yield, y, and the yield of bad ICs tested 
“sood,” ybg. Thus, ym = y + ybg. This leads to asecond question: How 
is ybg related to the fault coverage f? Consider the following definitions 
for the functional yield problem: 


y = actual functional yield (good chips). 
ym = measured functional yield. 
1—y = yield of bad chips. 
yi = yield of chips with i faults (i = 1,2,3---). 
ybg = yield of bad chips that test good. 
ybg(t) = yield of bad chips, with i faults, that test good. 
f = fault coverage (0 <f <1). 
yr = field reject rate due to functional defects. 


First, assume that ybg(i) = (1 — f)'- yi. Fori = 1, this implies ybg (1) 
= (1 — f) - y1, which is quite reasonable because it represents the basic 
assumption behind most logic simulation. That is, a logic simulator 
considers only the population of all chips in which there is only one logic 
fault per chip. Second, if the fault coverage is f, then (1 — f) is the fraction 
of all faults (chips) that will be undetected by the test sequence. For i 
=n > 1, the assumption amounts to stating that the probability of n 
faults being undetected is (1 — f)”. This is true only if multiple faults are 
independent of one another. 


* The yield of chips that are free of logic faults irrespective of their analog voltage/current 
behavior. 


1476 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1978 


The second assumption is that yi = y - (1 — y)! (i = 1,2,3,-). This is 
the geometric distribution function and has been found to correctly 
describe the distribution of defective cells in static RAM chips. In par- 
ticular, the average yield y and the average number of fault-producing 
defects per chip, xo, were measured and found to be related by the 
equation y = 1/(1 + x9). In the case of the RAM chips, xo = 2.7. For defect 
densities higher than, say, 5 per chip, a different distribution may be 
necessary. 

Under the above assumptions, 


yog= 5 fey ~y) 


id )¢ ) 
= a-fd-y 
tae esses aN 
Therefore, 
ym =. 
1—0=-f)pl—y) 


The field (or incoming inspection) reject rate yr is determined by the 
fraction of bad ICs that passed the functional test vector sequence, but 
that would have failed had the fault coverage been higher. 

Therefore, 


yr = ybg/ym, 
which gives 
yr =(1-f)—y). 


(Note that in this context undetectable faults are not included in the 
statistical base for fault coverage. The most likely negative consequences 
of undetectable faults are long-term reliability problems or intermittents, 
not failures at incoming inspection.) 

Inversely, for a given field reject rate (or “quality level’) the fault 
coverage would be 
ee 
1-y 
As an example, for an IC with a yield of 20 percent (y = 0.2), the fault 
coverage would have to be equal to or greater than 98.8 percent for a 
reject rate of 1 percent (yr = 0.01) or lower due to undetected logic 
faults. 


f=i- 


lil, FAULT COVERAGE AND LOGIC SIMULATORS 


The percent fault coverage quoted for a set of test vectors is an im- 
portant measure of their test effectiveness. Other things being equal, 
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a vector sequence with 90 percent coverage is twice as likely to identify 
faulty ICs as one with only 45 percent coverage. Equally important, 
however, is the question: Ninety percent of what? Unfortunately, the 
answer is usually 90 percent of what faults the simulator simulates. 
Therefore, in comparing different simulations and quoted fault coverage, 
it is essential to know the kinds of faults that were modeled. Many 
simulators model only classical faults (stuck-at-1 and stuck-at-0). Others 
may treat only gate output stuck-at faults. It is common in the case of 
printed circuit board simulations to consider only the pin faults of each 
IC on the board. In the experimental results of the next section, all 
classical faults were modeled in addition to the relevant CMOS stuck- 
open and stuck-on faults.? 

Second, to lower costs, simulations generally use only a “collapsed” 
set of faults. That is, several faults on different gates may cause the same 
faulted circuit behavior as viewed from the primary circuit outputs. 
Consequently, they are included in a single fault equivalence class with 
only one fault in that class being simulated. As an example, a chain of 
three inverters would have six physically distinct classical faults. How- 
ever, after fault collapsing, only two faults would be simulated. The 
obvious drawback to fault collapsing is that it distorts the relation be- 
tween the predicted and the observed number of failures. 

An additional factor can alter the ratio of predicted to observed fail- 
ures: the probability of and relation between physical faults and simu- 
lator faults. First, not all physical faults are equally probable. Second, 
an individual physical fault does not necessarily produce a single logical 
fault, i.e., one fault may map into two or more simulator faults or vice 
versa. In addition, the distribution function for physical defects produces 
many more chips with multiple faults than chips with only one fault. 
Obviously, this could pose a problem in the interpretation of failure data 
because fault simulators simulate only singly faulted circuits, not those 
with multiple faults. The effect of gross physical faults and the pre- 
ponderance of chips with multiple faults is to cause a higher number of 
failures during the initial part of the test sequence compared to that 
predicted by the simulator. 

The simulator can also contribute its own distortions. For example, 
it is clear that undetected faults that caused the simulation to oscillate 
may well cause an actual integrated circuit to fail during testing. In the 
same fashion, the simulator can be overly pessimistic with respect to 
other faults that produce or leave unknown states in the circuit (e.g., 
set/reset inputs to flip-flops). Conversely, the simulator may treat a 
particular fault as having been detected on a specific vector, but the fault 
causes the IC to fail on a following vector. This can occur because of 
differences between the discrete delays of the simulator and the actual 
delays present in the IC. 
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Table | — Circuit characteristics 


Circuit Inputs Outputs Gates 
D flip-flop 4 2 15 
Multivibrator 8* 6t 47 
MUX 14 7 238 


* Includes one as I/O and another for I/O control. 
t Includes one as I/O. 


The above are some of the more evident reasons why fault simulator 
results are only approximations to those actually measured on integrated 
circuits. This is true even if the simulator modeled all reasonable types 
of logic faults. 


IV. EXPERIMENTAL RESULTS 


Three CMOS integrated circuits were selected for studying the relation 
between fault coverage and yield. Two of these circuits were studied in 
some detail. The two circuits are (i) a dual D flip-flop functionally 
equivalent to the RCA CD4013A and (ii) a monostable/astable retrig- 
gerable multivibrator similar to the RCA CD4047A. 

Table I gives the circuit characteristics of the circuits. The column 
labeled “gates” gives the gate count for each circuit with the convention 
that a node to which two or more transmission gates connect is counted 
as one logic gate. Table II summarizes the fault characteristics of each 
circuit. The circuits were modeled to include the nonclassical stuck- 
open/stuck-on CMOS faults. 

Although CMOS faults double the number of total faults, it is not ob- 
vious whether they should be counted on a 1:1 basis with classical faults. 
If the probability of occurrence of stuck-open/stuck-on faults is markedly 
different than that of SAO, SA1 faults, then a weighting factor different 
than unity should be used to determine the total number of “effective”’ 
faults. As a second consideration, classical faults are collapsible into 
equivalence classes, but the CMOS nonclassical faults are individualized 
to single gates. 


Table Il — Fault characteristics 
(1) (2) (3) (4) (5) 
Physical Faults* Total Faults 
Circuit Gates Classical CMOS Total per Gate 
D-FF 15 38 38 76 5.1 
MULTI 47 133 134 267 5.7 
MUX 238 902 536 1438 6.3 


* After fault collapsing. 
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Table Ill — Fault coverage results 


(1) (2) (3) (4) (5) (6) (7) 
No. of Faults No. of — Vectors Fault Coverage* 
Circuit Gates perGate Vectors perGate Classical CMOS Total 


D-FF 15 5.1 15 1.0 100% 100% 100% 
MULTI 47 5.7 119 2.5 95 84 89 
MUX 238 6.3 5049 23 95 90? 93? 


* All faults, including undetectables. 


4.1 The D flip-flop circuit 


Not surprisingly, the D flip-flop circuit is 100 percent testable for both 
classical and CMOS logic faults (see Table III). Figure 1 shows the cu- 
mulative total fault coverage versus the fraction of the test vector se- 
quence applied to the Ic. In this case, 15 vectors were used to reach 100 
percent coverage. Strictly speaking, the points in Fig. 1 should have been 
connected by step functions that rise to meet each datum point. For the 
sake of clarity, however, straight line segments running directly from 
one point to the other were used. 

Figure 2 shows the relation between the total fault coverge f(total) and 
the classical fault coverage f(class). The total fault coverage falls below 
that for classical faults because of a characteristic lag in CMOS fault 


D FLIP—FLOP 


FAULT COVERAGE, f(TOTAL) IN PERCENT 





0 
0 10 20 30 40 50 60 70 80 90 100 
VECTORS IN PERCENT 


mi . 1—The D flip-flop: total fault coverage vs. normalized vector number (15 vectors 
total). 
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Fig. 2—The D flip-flop: total fault coverage and CMOS fault coverage as functions of 
the classical fault coverage for the vector sequence of Fig. 1. 


coverage. The CMOS lag is more clearly shown by the second curve, 
marked f(CMOsS), in Fig. 2. The lag is caused by the “history-dependence” 
of CMOS stuck-open faults that require at least two different vectors for 
detection. 

Figure 3 shows a comparison between the simulation data and actual 
measurements for 11,150 ICs from 28 wafers. It is important to note that 
the curves of Fig. 3 are reverse cumulative distribution functions.® The 
independent variable is the vector number. Fifteen vectors were used 
to test this circuit. Vector 1 was the first in the sequence and is shown 
at the origin of the graph. The dependent variable is the running sum 
of the number of detected faults (or chip failures) beginning with vector 
15 and proceeding to vector 1. Hence, the curves indicate the likelihood 
that a defective chip will fail at a specific vector. 

In the simulation curve, the cumulative number of detected faults is 
shown. For the wafer data the cumulative number of functional, or logic 
test, failures is plotted. Reverse cdf’s were chosen because they reveal 
the structure of the tail regions where the fault coverage is not changing 
as rapidly as it is near the beginning of the vector sequence. Of course, 
the tail of each curve illuminates most clearly the effects of ICs with single 
faults. 

The simulation data are the same as those used for Fig. 1 in which all 
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Fig. 3—The D flip-flop: reverse cumulative distribution functions for functional yield 
loss plotted as functions of the vector number. The simulation data are that of Fig. 1. The 
measured data points were obtained from 11,150 chips from 28 wafers. 


faults, both classical and CMOS, are included. There is reasonable 
agreement between the “predicted” and the measured rcdf’s. Only by 
coincidence would the two curves lie one upon the other because each 
is plotted in absolute numbers. Ideally, of course, they would be sepa- 
rated by a constant vertical displacement. The agreement is one of 
general shape. A specific quantitative comparison is treated later in this 
paper. 

Naturally, near the beginning of the sequence there are more actual 
Ic failures than indicated by the simulation. Recalling the factors dis- 
cussed in Section III above, initial failures are probably caused by gross 
shorts, opens, and multiple faults. (All chips, however, were prescreened 
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for contact failures.) The slight dip in the wafer data for vectors 7,8,9 is 
caused by a 2:1 overprediction of fault coverage. Vectors 7,8,9 detect, 
in a worst-case sense, set and reset faults which in a real IC are more likely 
to fail earlier in the vector sequence. 

Only one fault is detected by vector 12, the data input transmission 
gate stuck-on. The failure rate on that vector was 17/11150 or 0.15 per- 
cent. The only other vector that detects a single CMOS fault is vector 15. 
The fault is a stuck-open in the master flip-flop feedback transmission 
gate. The failure rate was 1/11150, or 0.009 percent, quite low compared 
to the stuck-on fault. Strangely, there are 8 ICs (0.07 percent) that failed 
at an added vector 16 where the fault coverage is already at 100 percent. 
Vector 16 forces the set and reset inputs active at the same time. Al- 
though the behavior of the fault-free circuit is deterministic, no struc- 
tural faults (classical, stuck-open, or stuck-on) remain in the circuit to 
be detected at that vector. The failures may have been caused by analog 
effects. 

The predicted field reject rate yr, as a function of fault coverage f, can 
be computed from the data of Figs. 1 and 3. The procedure begins first 
with Fig. 3 where the measured yield is obtained as a function of the 
vector at which the sequence was truncated. Next, the fault coverage for 
the truncated vector set is established by reference to Fig. 1. Of course, 
if the entire untruncated vector set is applied to the IC, the cumulative 
fault coverage at the last vector is 100 percent and the measured yield 
ym should equal the true yield y. 

The results of the above calculations are shown in Fig. 4. The theo- 
retical relation, yr = (1 — f)(1 — y), is the solid line. The two sets of data 
points represent the computed yr based, first, on all faults and, second, 
on only classical faults. The latter lie closer to the theoretical predic- 
tion. 

Several conclusions can be drawn from the results of Fig. 4. The first 
is that the equation is a good estimator of the reject rate, but is somewhat 
pessimistic at high fault coverage. Second, the difference between the 
predicted and measured yr at high values of f can be explained by a 
proportionally larger actual fault coverage used as the abscissa (in each 
case). Finally, even though CMOS stuck-open/stuck-on faults were 
identified in some of the circuits, their relative probability seems to be 
much less than that of the more general classical stuck-at faults. In other 
words, a 1-to-1 weighting factor does not appear to be warranted. 


4.2 The multivibrator circuit 


The multivibrator was not 100 percent testable. The cumulative fault 
coverage for all faults is shown in Fig. 5. The occasional abrupt jumps 
in fault coverage are caused by sequential portions of the circuit. That 
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Fig. 4—The D flip-flop: field reject rate vs. total fault coverage as determined from the 
data of Figs. 1 and 3. The theoretical expression is indicated by the solid line. 


is, several vectors are needed before a “fault effect” is generated at a gate 
and several more are necessary to propagate the fault to an output. When 
faults from that part of the circuit finally reach an output, there is a sharp 
increase in coverage. 


100 
MULTIVIBRATOR 

90 
80 
70 
60 
50 


40 


30 


FAULT COVERAGE, f (TOTAL) IN PERCENT 


20 





O 
0 10 20 30 40 50 60 70 80 90 100 
VECTORS IN PERCENT 


Bi 5—The multivibrator: total fault coverage vs. normalized vector number (119 vectors 
total). 
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Fig. 6—The multivibrator: total fault coverage and CMOS fault coverage as functions 
of the classical fault coverage for the vector sequence of Fig. 5. 


The final vector set provided a coverage of 89.1 percent, or 238 faults 
out of 267. The remaining 29 faults are all undetectable. There were two 
primary causes for the undetectable faults. The first was the presence 
of “asynchronous” circuit behavior (in the retrigger control section). The 
second was the use of two D flip-flops which had data inputs tied per- 
manently to a fixed logic value (lack of controllability). Two undetect- 
ables occurred in a NOR latch: all CMOS latches formed by cross-coupled 
NOR or NAND gates have two undetectable faults. 

Figure 6 shows the relation between the total fault coverage f(total) 
and the classical fault coverage f(class). Again the lag in CMOS fault 
coverage is evident. The total fault coverage reaches 89.1 percent. 
Classical coverage is 94.7 percent; CMOS coverage is 83.6 percent. Of the 
29 undetectable faults, 22 were CMOS and 7 were classical. 

Figure 7 compares simulation data with measurements taken from 
a single wafer. The wafer contained 418 chips which passed initial contact 
tests. Of those, 275 passed the logic tests for a gross functional yield of 
66 percent. Again, the reverse cdf’s are used to show the behavior in the 
tail regions near the end of the vector sequence. The overall agreement 
between the two curves is reasonable. 

As in the above D flip-flop example, the field reject rate yr can be 
calculated for the multivibrator circuit from the corresponding data 
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Fig. 7—The multivibrator: reverse cumulative distribution functions of the vector 
number. The simulation data are that of Fig. 5. The measured data points were obtained 
from 418 chips from one wafer. 


(Figs. 5 and 7). The resultant points are shown in Fig. 8. The solid line 
is the predicted relation yr = (1 — f’)(1 — y), where /’ is the fault coverage 
for all detectable faults. 

The data of Fig. 8 suggest the same observations as for the D flip-flop: 
The actual fault coverage appears to be higher toward the end of the 
vector sequence than that predicted by the simulator. Also, the theo- 
retical yr is best matched by the “classical faults only” data. Again, this 
indirectly implies that the relative frequency of CMOS faults is signifi- 
cantly less than that of the classical faults. In addition, for both the D 
flip-flop and the multivibrator the curves are similar above the 75 per- 
cent fault coverage point. In particular, each indicates that a coverage 
of 85 to 90 percent is needed to achieve a reject rate of 1 percent or less. 
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Fig. 8—The multivibrator: field reject rate vs. total (detectable) fault coverage as de- 
ne from the data of Figs. 5 and 7. The theoretical expression is indicated by the solid 
ine. 


On the other hand, the true yields for each are approximately equal (85 
and 77 percent, respectively). 


V. SUMMARY 


The fault coverage results of the previous section have been summa- 
rized in Table III. To compare the difficulty of generating test vectors 
for one circuit versus another, a measure of circuit complexity is needed.” 
Gate count [column (1)] is a poor measure because it reflects only silicon 
area and not the interconnections that create the actual circuit. One 
potential measure of circuit complexity that indicates at least the 
magnitude of test vector generation is the ratio of the number of vectors 
to the number of gates [column (4)]. In that sense, the multivibrator is 
2.5 times more complex than the D flip-flop, and the multiplexer (MUX) 
is 23 times more complex. Of course, a measure could be devised to in- 
corporate the number of circuit inputs and outputs. However, for the 
three circuits the most dominant effect is that of the number of test 
vectors. The reader can readily interpret from Table III the magnitude 
of the test generation and fault coverage problems for large-scale 
silicon-integrated circuits with thousands of gates. In addition, for 
modern Ic test equipment the number of circuit inputs and outputs 
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generally does not affect the functional test time (as long as the number 
is less than the test set maximum). 

Nevertheless, there are two major drawbacks to using “vectors/gate” 
as a measure. First, it is retrospective: Only after effort has been ex- 
pended to develop the test vectors does the “complexity” become known. 
The second reason is that it depends somewhat upon the method or skill 
used to generate the test vectors themselves. In particular, the test vector 
sequences used for the example circuits are certainly not unique. Nor 
is it likely that any of them is optimal in the sense of being the least 
number for the same level of coverage. The basic problem is that there 
probably isn’t any simple one-dimensional measure of circuit complexity 
that is useful for a broad spectrum of circuit types. 

The relative frequency of CMOS stuck-open/stuck-on faults appears 
to be significantly less than that of the classical stuck-at-0/stuck-at-1 
faults. On the other hand, CMOS nonclassical faults do occur. Perhaps 
the best approach to resolving this quandary would be a study of many 
different CMOS ICs. The investigation would use vector sets of high di- 
agnostic capability to determine which kinds of logic faults are impor- 
tant. 

Finally, the data presented in this paper support the reject rate 
(quality level) concept as an answer to the question, “How much fault 
coverage is enough?” However, the total economic picture obviously 
must take into account the cost of developing the vectors and the cost 
of using (applying) them. Only when all three of the above factors are 
considered can the cost of integrated circuit testing be properly 
judged. 
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Jitter Comparison of Tones Generated by 
Squaring and by Fourth-Power Circuits 


By J. E. MAZO 
(Manuscript received October 28, 1977) 


The tone-to-jitter power ratio is calculated for some conventional 
methods of generating a tone at the pulse repetition frequency of a PAM 
data signal by operating on the latter by an appropriate nonlinearity. 
Attention is focused on this ratio for a fourth-order nonlinearity, which 
will produce a tone even in the absence of excess bandwidth, and on this 
ratio for a square-law nonlinearity for small excess bandwidth. If the 
excess bandwidth is less than about 50 percent, the fourth power is 
superior. In particular, it yields a 10-dB improvement for 12 percent 
roll-off and binary data. 


l. INTRODUCTION 
Successful detection of the symbols in a pulse-modulated waveform 
requires a knowledge of the pulse repetition period T. Specifically, if the 
signal s(t) is of the form 
s(t)= YO ang(t — nT) (1) 


aoe 
where a, are independent equiprobable binary symbols having values 
+1, knowledge of T is required for proper sampling of s(t) to recover the 
a,. Due to small differences in transmitter and receiver oscillators, a 
priori information concerning T' is not usually sufficient, and constant 
updating of the precise current value of T' is required. Often one prefers 
to.deduce such information directly from eq. (1), rather than directly 
transmitting a tone at frequency 1/T Hz. 

A very popular method is to pass s(t) through an appropriate nonlinear 
circuit (e.g., a square-law) so that a tone is generated at frequency 1/T.! 
For example, using the square-law operation we have the following 
identity: 

© 2 fo) 
s(t) = (X angtt—nT)) = Yee —AT) 


n=—-o@ 


$F aname(t —nT)g(t -— mT) (2) 
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where in writing (2) we have used a2 = 1.* The essential features of (2) 
are made clear on noting the Poisson sum formula! 


f(t -—nT) = 1 y e2nimt/TF (= m) (3) 
n=—e T im T 
Thus the first term of the last member of (2) consists of a series of sine 
and cosine terms given by the right member of (3), where in (3) f(¢) is 
replaced by g?(t). In our particular case, we are especially interested in 
the terms corresponding to frequency w = 22/T rad/sec, i.e., the 


terms 
1 : Qt : Qa 
peek 2xi/TF =) + —2xi/TR @ a | 4 
ak & e = (4) 


where, explicitly 
F(w) = ff * g2(t)e-ietde, (5) 


In the hardware, these terms are isolated by a very narrow postfilter (of 
bandwidth B Hz, say) approximately centered about this frequency. 

In order that F[+ 27/T] # 0, “excess bandwidth” is required for the 
pulse g(t), that is, its frequency spectrum must extend beyond the Ny- 
quist frequency w = z/T. The percent of excess is usually referred to as 
the “rolloff.” 

Once g(t) is given, the tone power is easily evaluated using (4) and (5). 
Thus, for simplicity, assume the g(t) is an ideal Nyquist pulse having 
the real Fourier transform G(w) shown in Fig. 1, where percent of rolloff 
equals 100 X a. 

Using the general formula 


Fie > f DC eaaad! (6) 
TT —a 
we note how F(2z/T) depends on G(w) only in the rolloff region 
para) < |o| Sp (l +a). 


For the special case of G(w) given in Fig. 1 we calculate 


F(a) =", (7) 


t For a multilevel situation we would replace a2 = 1 with E(a,,)2, where E denotes sta- 
tistical expectation. 
n (3 


Figs = f ” exp[—iwt (tat 


is the Fourier transform of f(t). 
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Fig. 1—Fourier transform G(w) of the Nyquist pulse g(t) used for the calculations. 


allowing (4) to be rewritten as 


a 2at 
= cos —— 8 
. cos 7 (8) 
Thus from (8), 
2 
square-law tone power = a : (9) 


Tone power is not a sufficient measure of performance, of course. This 
power must be compared to the power in the background noise. One 
component of this background would be additive noise on the channel. 
This is usually negligible, however, and the background to which we refer 
is self-generated. Mathematically it is given by the last term in (2). This 
added background will cause the zero-crossings of (8) to be perturbed, 
or “jitter,” about their nominal values. Computing the tone power/ 
background power ratio for various situations is the purpose of our work.* 
In addition to the state of affairs just described, two other proposals are 
of considerable practical interest. These may be conveniently and de- 
scriptively described as 

(i) Prefiltering. 

(it) Fourth power law. 

To motivate the first, recall that the tone power is determined by the 
“overlap” of the excess bandwidth regions in the integral (6). This power 
will not be changed if we filter out the remaining central portion of the 
pulse (the region |w| < (/ T)(1—a)i in Fig. 1) before we do the squaring 
operation. The elimination of this portion of G(w) does decrease the total 
power in the background term of (2). Will it improve the tone-to-jitter 
ratio in the neighborhood of w = 27/T as well? 


t Since the tone will be “picked-off” out of the background by a narrow-band filter or 
ee loop, only the value of the background power spectrum at w = 2zx/T will be 
needed. 
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The motivation of the second situation also begins by mentioning the 
overlap contribution to the integral in (6). We note that the overlap, and 
hence the tone power (9) vanish as a vanishes. Thus for a squaring circuit 
no tone is produced if there is no excess bandwidth. We shall see later 
that this is not true for all nonlinearities, and in fact a fourth-power law 
will produce a strong tone even when a = 0. Again, what about the jitter? 
Our calculation of the latter for the fourth power is a completely new 
result. To appreciate the difficulties involved here note that the time 
averaged autocorrelation function R(7) for the output of the fourth- 
power law is given by 


R(r) =E 7 f, stone — r)dt (10) 


with s(t) given by (1). A straightforward evaluation of the matematical 
expectation in (10) would involve dealing with the eighth-order terms 


Ean On Gn3°** Ang: (11) 


The bookkeeping involved with (11) would be unmanageable. One 
novelty of our method is the introduction of a technique from the algebra 
of symmetric polynomials? for skirting the direct evaluation implied by 
(11). 

Since we are mainly interested in knowing if prefiltering or the 
fourth-power law can produce large improvements in tone-to-jitter ratio, 
our explicit evaluations will be based on simple pulse shapes. For pre- 
filtering, overlap is important and we stick to the pulse shape with fre- 
quency characteristic given in Fig. 1. For fourth-order we assume that 
small excess bandwidth produces only higher-order corrections to the 
effect present when a = 0. Consequently in this case we use only the a 
= 0 pulse. 

We shall show that prefiltering offers no improvement at all. With or 
without prefiltering the output tone-to-jitter ratio is about 12 dB if an 
output filter 10 Hz wide is used or 22 dB for one 1 Hz wide. This assumes 
a 12 percent excess bandwidth as in the Bell System 209 data set. 
Numbers for the fourth-power law are 10 dB better than this, which is 
a significant improvement. 

Before proceeding, a final comment is in order. This concerns a recent 
publication of Franks and Bubrowski? concerning prefiltering and the 
square-law nonlinearity. Their claim is that if prefiltering has a sym- 
metrical result about z/T and the post-filter is symmetrical about 27/T, 
there will be no jitter about the zero-crossings of (8). This is true, but if 
T were known exactly so that the required symmetrization could be done 
exactly, then there would be no need to measure 7’. If, however, we 
symmetrize about a T’ # T, then the background, using a standard 
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representation of passband signals, would have the form 


2 

y(t) cos 7 t (12) 
with no quadrature component relative to 27/T”’. If T’ = T we see why 
the zeros of (8) are unchanged. If T’ # T the quadrature component of 
(12) relative to 2z/T will come in, with a strength independent of how 
small T’ — T is. Only the beat frequency depends on TJ” — T.. Thus the 
Franks-Bubrowski result might be termed unstable and not applica- 
ble. 


ll. BACKGROUND SPECTRUM FOR SQUARING CIRCUIT 


In this section we compute S,(27/T), the value of the spectrum S,(w) 
of the jitter term which appears at the output of the squaring circuit at 
angle frequency w = 27/T.t We do this for the special pulse of Fig. 1 with 
and without prefiltering. In terms of this quantity the tone-jitter ratio 
will be, for a final filter of bandwidth B, using (9), 


tone power a? Qa 
ee ca B. 13 
background power 8 / z ve) 


The quantity S.(w) is the Fourier transform of the autocorrelation 
function 


R(r) = f, Fl%E)s% — 7)|dt (14) 
—_ 1 
S (ay f " e-fe"R(a)dr nae (15) 


where in (14) s2(¢) is given by (2). Denoting g(t — nT) by g, and g(t — 
t — nT) by hy, the first item to evaluate is 


Es*(t)s*(t — 7) = El(Zan8n)*(Zanhn)I, (16) 


the expectation being taken over the i.i.d. binary variables a,, having 
values +1 with equal probability. The expectation in the right side of 
(16) can be done directly, using 


Eat gQ;d, = SpqSrs + Sprdgs + Spsdqr ~ 25pq5pr5ps (17) 
to yield 
E[(Zan8n)?(Zanhn)?] = (DEn)(Dhh) + 2(VEnhn)? — Wenha (18) 


Looking ahead to the fourth-order case when we will need the average 
of eight order terms, we will not be able to write the analog of (17) in any 


t The subscript c on S,(w) emphasizes that only the continuous portion of the complete 
spectrum is being considered. 
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Table | — Summary of the evaluation of background power for a 
squaring circuit 





Contribution to 


Coefficient Term S(22/T) ( without ) 
coefficient 
1 (11)(aa) Tone term 
T 
9 la)2 saad 
a) 16 
2 
-2 (11aa) aan 


manageable way. Thus it will be pedagogically useful to introduce the 
new method here first, and evaluate (18) again. We first notice from the 
structure of the left side of (18) that only terms of the type (2g2)(Zh2), 
(Ygh)2 and Yg2h? can occur on the right-hand side, i.e., the answer must 
be of the form 


A(2g?)(Zh2) + B(Xgh)? + C(2g2h?) (19) 


where A, B, and C are constants independent of whatever values the g,, 
and h,, take. Setting g, = hp, = dno, the left-hand side of (18) is obviously 
unity. Then also using (19) we obtain the result that 


1=A+B+C. (20) 
Likewise setting g, = hn+1 = 5no yields (since g,h, = 0, all n) 
1=A (21) 


while the choice gp = ho = 1, 8) = hy = 1, gp = hy = 0,k ¥ 0,1 pro- 
vides 

8=4(A + B) + 2C. (22) 
The solution of (20)-(22) is A = 1, B = 2, C = —2 in complete agreement 
with (18). 

It is convenient to introduce the following shorthand: a sum 27 .-. 
will be denoted by a parenthesis (_ ). If the nth term of the sum is gf h? 
the notation 

(11---llaa---a) 
ptimes_ q times 
is used. Thus the terms in (19) are of the types (11)(aa), (1a)?, and (11aa) 
and the right side of (18) is given in the first two columns of Table I. 
Having now obtained the sums and coefficients in (18), the next step 


is to evaluate the sums by the Poisson sum formula (regarding 7 as a fixed 
parameter). Note first that, from (2), the term (11)(aa) in (18) is due to 
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the deterministic part of (2) and hence the background terms are 
only 


2(1a)2 — 2(11laa). (23) 


After the Poisson sum formulas are evaluated we next perform 


= (()d 

— t 

J, 
to eliminate the t-dependence. Finally the Fourier transform with re- 
spect to T' is taken. For example, 


(llaa) = s g2(t —nT)g2(t —7 —nT) 


n=—o 


1 2 . 20 
=— 2rimt/T FR iG ) 
T pron e 
where F(w) is the Fourier transform of g2(t)g2(t — 7). If Go(w) is the 
Fourier transform of g2(t) then 


rae f * 9-iw’tGo(w’)Go(w — w')do’. 
Ir J x 


When we time-average (1laa) only the m = 0 term survives, giving 
aLe 
2xT J- 
The Fourier transform of this with respect to 7 is simply (1/T)Go(w)- 
G»(—w) which is to be evaluated at w = 27/T. 

The actual contribution of the terms in (23) is listed in the third col- 
umn of Table I, the same values being obtained with or without prefil- 
tering. Thus from (9), (13), (23) and Table I we obtain for the squaring 
loop, with or without a prefilter, 


T oo 
Zs f, (1laa)dt = £ F(O) = e'&’TGo(w’)Go(—w’)dw’. 
T Jo T es 


tone power __ a?/8 
background power 28[2(1a)? — 2(11aa)] 
ee Qa 
2BT1—-—a’ 

The fact that (24) becomes infinite for a = 1 is due to the fact that the 
spectrum has a zero then, and higher-order terms in B would be required 
to estimate the background power. 

Applying these results to the 209 data set where a = 0.12, 1/T = 2400 
sec” !, (24) evaluates to about a 12 dB tone/filter ratio if B = 10 Hz, or 
22 dB if B = 1 Hz. 

In order to provide some contrast between the cases which do or do 
not involve prefiltering, the output spectrum in a neighborhood of 27/T 


a<1. (24) 
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Fig. 2—Background power spectra for squaring circuit, with and without prefilter. 


is given in Fig. 2 for small a, i.e. neglecting the (11aa) contribution which 
is proportional to a?. 

The two spectra coincide above 27/T, but the divergence between the 
two on the lower side of the tone frequency is apparent from the 
curve. 


lll. BACKGROUND SPECTRUM FOR FOURTH-POWER CIRCUIT 

The lines in the spectrum of s*(¢) come from the deterministic terms, 
namely from Es4(t). Setting g, = h,, in (18) provides us with the evalu- 
ation 

Es4(t) = 3[Zg2(t — nT)|? — 22g4(t — nT). (25) 
If ~« = 0 only the second term in (25) contributes a tone at 27/T. Since, 
when a = 0, 
2 2rt 


4 
2Dg4(t —nT) = q + 3 cos Tp? 


1/2)? _ 2 
(a) 9° 


As before, the first step in the evaluation of the background spectrum 
is the calculation of 


the power in the tone is 


Es4(t)s4(t — 7). (26) 
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Table Il — Summary of the evaluation of the background power 
for a fourth-power circuit 





Contribution to 


Coefficient Term S(27/T) ( without ) 
coefficient 
96 (11llaa)(aa) 0 
256 (11laaa)(1ia) T/20 
96 (1laaaa)(11) 0 
24 (1a)4 T/6 
9 (11)?(aa)? Tone term 
72 (11)(aa)(1a)? 0 
4 (1111)(aaaa) Tone term 
64 (111la)(1aaa) T/120 
72 (1laa)? T/30 
—6 (1111) (aa)? Tone term 
-6 (aaaa)(11)2 Tone term 
—96 (111a)(1a)(aa) 0 
—96 (aaa1)(1a)(11) 0 
—72 (1laa)(11)(aa) 0 
~—144 (1laa)(1a)? T/12 
—272 (11llaaaa) T/36 


Here the second technique introduced in Section II becomes decisive. 
Just as the results of the evaluation of (18) are summarized in Table I, 
the first two columns of Table IT give the evaluation of (26). 

We must next evaluate the contribution of the terms to S,(27/T). 
Simplifications occur when a = 0, since then 


- WT 
sin —_ 





(11) = (aa) = 1 and (la) = 


TT 


T 


The final result of applying the Poisson sum formula, averaging over t, 
and Fourier-transforming with respect to 7 gives the results in the third 
column, Table II. 

Collecting this we have, for a final filter of bandwidth B at 27/T, 
that 


signal power 5 (27) 


background power 8BT 
This is a significant improvement over the result (24) for the squar- 
ing-loop for small a. In fact, using a = 0.12 again we calculate an im- 


provement factor of 9.16 or close to 10 dB. 
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On Predictive Quantizing Schemes 


By P. NOLL 
(Manuscript received October 26, 1977) 


This paper analyzes the performance of various predictive quan- 
tizing schemes both for noiseless and noisy channels. The fidelity cri- 
terion used to define optimum performance is that of minimum 
mean-squared error. The first part of this paper compares differential 
pulse code modulation (DPCM) with a system that lacks the feedback 
around the quantizer. Such a system (that is called D*PCM in this 
paper) is actually a pulse code modulation (PCM) system with a pre- 
filter and a postfilter. In the second part of this paper a noise-feedback 
coding structure is used as a framework for a unified analysis of pre- 
dictive quantizing schemes with a frequency-weighted mean-squared 
error as the performance criterion. The last part of this paper extends 
the analysis to include the effects of channel transmission errors on 
the overall performance of these predictive quantizing schemes. It is 
shown that DPCM and D*PCM when appropriately optimized are less 
sensitive to channel errors than PCM, and that the performances of 
DPCM and D*PCM are almost identical in the case of high bit-error 
rates. 


Il. INTRODUCTION 


Predictive quantizing schemes employ prediction to exploit the in- 
herent redundancy of input signals; the difference between the actual 
sample of an input signal and its estimate is quantized and transmitted 
to the receiver in a digital format. If the samples of the input are highly 
correlated, then the variance of the samples of the difference signal will 
be significantly less than the variance of the input samples. Hence, the 
overall error between input and output of the communication system 
will be lower than that of a conventional pulse code modulation (PCM) 
system. The first part of this paper compares PCM and two differential 
(predictive) pulse code modulation systems (see Fig. 1). A noise-feedback 
coding structure is then used as a framework of a unified analysis of 
predictive quantizing schemes (including those of Fig. 1) on the basis 
of a frequency-weighted error criterion. The last part of this paper ex- 
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(a) PCM 





Fig. 1—Structures of PCM, DPCM, and D*PCM coders. 


tends the analysis to include the effects of channel transmission errors 
on the overall performance of these predictive quantizing schemes. 

The differential pulse code modulation (DPCM) system has a feedback 
around the quantizer resulting in a prediction that is based on previous 
reconstructed samples (Fig. 1b). The idea is to base the prediction on 
information that is also available at the receiver.! The D*PCM system 
is an open-loop quantization scheme,? i.e., previous samples of the input 
are used for predicting the actual input samples (Fig. 1c). The term 
D*PCM indicates that we have denoted by d*,, the sequence of difference 
samples in the open-loop system. It is important to realize that coder 
and decoder calculate different prediction values. We also see, from Fig. 
Ic, that in this latter scheme the prediction network can be replaced with 
a more general linear network. The resulting scheme is then actually a 
PCM scheme with pre- and postfiltering and we shall use this latter 
scheme to derive bounds for the D*PCM performance. 

Let us briefly discuss the differences between DPCM and D*PCM. We 
denote by gn = dn — rn and gn = d*, — ry the quantization errors of 
DPCM and D*PCM, respectively, and by r, the quantized versions of the 
difference samples. 

It is easy to see that the total coding error 


tn =Xn — Yn (1) 
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between encoder input x, and decoder output y,, is given by 
tn = dn for DPCM (2) 


provided the channel is error-free and the decoder uses the same pre- 
dictor of frequency response P(w). On the other hand, 


tn = dn + (£n — En) for D*PCM, (3) 


i.e., the total error is increased by the difference between the prediction 
values <, and %, of the predictors of encoder and decoder, respectively. 
Alternatively, we may write 


tn = Qn*hn for D*PCM (4) 


provided that the networks of encoder and decoder are reciprocal. In eq. 
(4), « denotes discrete-time convolution, and h,; n = 0,1,2... is the 
impulse response of the linear decoder network. The result can be easily 
derived by recognizing that 


th =Xn — Yn 
= Xn —Tnthn 
= Xn — (d*n —Gn)*hn 
= Qnthn + Xn — Xn*8n*hn 
= qn*hn 


since gn *hp = bn and X;,*6bn = Xn (Zn is the impulse response of the en- 
coder network and 6,, is the Kronecker delta). 

Quantization errors are approximately white noise samples if the 
number of quantizer levels is sufficiently high. Hence in D*PCM the total 
error is nonwhite noise with each quantization error causing an infinite 
output sequence. This error propagation effect is also sometimes called 
error accumulation; this latter term, however, should be used cautiously, 
because it may seem to imply that D*PCM cannot give any improvement 
in signal-to-noise ratio (SNR) over PCM, or, even more strongly, that the 
overall performance can only be degraded. 

One purpose of this paper is to analyze and explain the differences 
between DPCM and D*PCM. The D*PCM coding system has been analyzed 
by Bodycomb and Haddad?; their approach has been based on a pre- 
dictor optimized for a minimum prediction error variance and they have 
shown that this predictor as a prefilter for a Gaussian process produces 
the same total error variance as a PCM system. We shall use the term 
MMSE predictor to describe such a predictor (that has been optimized 
for a minimum prediction error variance). We shall see very shortly that 
this D*PCM scheme can perform better than PCM provided that the 
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predictor is reoptimized; the MMSE predictor is not optimal in this 
context. 

We have already mentioned that D*PCM can be viewed as a PCM 
scheme with pre- and postfilters. If we model the quantizer in the PCM 
scheme as an additive noise source, the optimization of this scheme is 
almost identical to a joint optimization of pre- and postfilters in com- 
munication systems in which channel errors are present. Many contri- 
butions have been made to this problem both for continuous-time and 
pulse-modulated communication systems.4!3 One common result that 
can be extracted from these papers is that half-whitening of the input 
spectrum minimizes the overall mean-squared error (MSE) between 
input and output of a communication system with pre- and postfilters 
in which additive white noise (either caused by a quantization or by 
channel errors or by both effects) is present. Half-whitening is obtained 
if the magnitude-squared frequency response is inversely proportional 
to the square-root of the power density spectrum of the input signal. 

PCM schemes with pre- and postfilters do not take into account that 
in systems with quantizers not only can the input spectrum be shaped 
to improve the overall performance but that, additionally, the quanti- 
zation noise spectrum can be shaped as well to further improve the 
performance of the system. This problem has been discussed in detail 
by Kimme and Kuo!‘ and later by Brainard and Candy" on the basis 
of different coder configurations. These coders have in common a 
feedback of filtered quantization noise to the input of the quantizer and 
we shall use the term noise-feedback coding (NFC) for this approach. 
Our analysis differs from earlier contributions in that a power constraint 
on the quantizer input is not needed. We offer a simplified solution based 
on well-known results in prediction theory. 

It is the aim of this paper to discuss the differences between DPCM and 
D*PCM both for noise-free and noisy channels and to show how they 
relate to noise-feedback coding if the basis of the comparison is a fre- 
quency-weighted minimum mean-squared error. The organization of 
this paper is as follows: in Section II we calculate the differences in 
performance between predictive quantizing schemes with and without 
feedback around the quantizer. We show that D*PCM can perform better 
than PCM for all nonwhite input spectra, but that its performance is al- 
ways below that obtainable with a DPCM scheme. A bound will be derived 
for the differences in performance between these two schemes, and a 
first-order Markov source will be used as an example to explain these 
differences. Section III analyzes noise-feedback coding; its structure is 
given by a prefilter followed by a quantizer with feedback around the 
quantizer.!4 We note that PCM, PCM with noise feedback, DPCM, and 
D*PCM are special cases of this configuration; thus a unified approach 
is possible. We optimize this coder with a frequency-weighted mean- 
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squared error as the performance criterion and show that the prefilter 
has to be a whitening filter for the input signal irrespective of the chosen 
frequency-weighting and that the feedback filter is the MMSE predictor 
of the weighted input spectrum. We also give frequency-weightings for 
which the noise-feedback scheme degenerates to DPCM or to D*PCM. 
Section IV extends the analysis to include the effects of channel trans- 
mission errors on the overall performance of predictive quantizing 
schemes. It is shown that the effects of these errors on total MSE can be 
significantly smaller than those in PCM. In D*PCM the optimum filters 
minimize simultaneously quantization and channel error variances and 
do not depend on the bit-error rate. In the case of DPCM a compromise - 
is needed in order to minimize the total effect of both noise sources on 
the total MSE. At high bit-error rates the performances of D*PCM and 
DPCM are almost identical if their prediction networks are identical. 

The analyses are made under certain restrictions; first of all, we use 
the MSE as a performance measure (in Section III a frequency-weighted 
MSE criterion will be taken into account). Second, the analyses are based 
on the assumption that the quantizer can be modeled as an additive 
white noise source. It is known that this model is accurate for quantizers 
with a sufficiently high number of levels. The model is a poor approxi- 
mation, however, in the case of coarse quantization, especially if the 
quantizer input samples are highly correlated. The results of our analysis 
could be extended to these cases by modifying the model but we do not 
consider such an extension in order not to obscure the main results. We 
also assume that the variance of the quantization noise is much smaller 
than that of the signal to be quantized. Comparison with simulation 
results will show the range where our rather restrictive assumptions 
hold. 


ll. ANALYSIS OF PCM, DPCM, D*PCM 


The principal aim in this section is to calculate and to compare the 
variance of the total errors in PCM, DPCM, and D*PCM. We assume that 
the input is a sample of a zero-mean stationary random sequence {xp} 
with autocorrelation function 


Rx (k) = E|[xatnrk|; (5) 


power density spectrum (pds) 
Siw) = SC Re(kemite, (6) 
k=—0 
and variance 


o2 = R,(0) = > [U Selelde. (7) 
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S,,(w) will be assumed to be a rational function of w. This is not really 
restrictive since most spectra of practical interest can be approximated 
by a rational function. The assumption implies that the pds can be 
represented by S,(w) = 72A(w)A*(w) where 72 > 0 is a scale factor and 
A(w) is the ratio of two polynomials whose zeros are inside the unit circle 
of the z-domain (factorization theorem for rational spectra). All power 
density spectra are nonnegative continuous functions defined for w = 
[—z,z]. Since all signals are represented by stationary random sequences, 
all filters are necessarily of discrete-time type. The action of the quan- 
tizer is represented spectrally as white noise added to the quantizer input 
signal: 


Sq(w) = 07 = R,(0). (8) 


We are interested in the variance 
1 7 
oP = Ele] => f Silw)de (9) 
on —7T 


of the total error tp = Xn — Yn [eq. (1)], where S;(w) is the power density 
spectrum of the zero-mean random sequence {t,,}. A frequency-weighting 
of the total error can easily be added if necessary (see Section III). 


2.1 PCM 


The total error is identical to the quantization error. Hence we 
have 
a2 = o: = ef -o2. (10) 
The quantizer performance factor «7 depends on the properties of the 
quantizer and on the probability density function of the signal being 
quantized; its value is the noise variance generated by a quantization 
of a unit-variance signal. Table IV in Section IV lists various values for 
the cases of 1-bit and 2-bit quantizers. For a given quantizer we thus find 
that 


min {07} = ae - 02. (11) 


PCM 
2.2 DPCM 


The total error is again identical to the quantization error but its 
variance is now proportional to that of the difference signal: 


c= a: = e2 -o4. (12) 


Remark. Some caution is needed if two coding schemes A and B, e.g., 
PCM and DPCM, are being compared. We have to take into account dif- 


1504 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1978 


ferences in the quantizer performance factors e4 and «%, of the two 
schemes, because the probability density functions of the signals at the 
corresponding quantizer inputs may differ. For example, let A and B 
denote PCM and DPCM, respectively. The gain of DPCM over PCM in 
signal-to-noise ratio is then given as 


2 2 2 

Ota om €gA x ; (13) 
2 2 2 

O:B €gB Od 


The ratio ¢24/e2 is very close to unity if the quantizers have a logarithmic 
characteristic because their performance is relatively independent of 
signal statistics. The ratio is also close to unity if the samples of the coder 
input sequences are Gaussian distributed since all coders discussed in 
this paper employ linear networks which do not affect the Gaussian 
distribution. 

In DPCM systems the prediction is affected by the quantization due 
to the error feedback, i.e., the predictor uses previous reconstruction 
values yj = xj — tj = xj —qj;J=n—1,n —2,... instead of the corre- 
sponding input samples x;. The influence of this feedback on the pre- 
diction error variance has been analyzed elsewhere!®—18; it will be briefly 
discussed in the following example but will then be neglected in the 
further analyses. It is known that this simplification is valid if quantizers 
with at least eight levels are employed. If necessary, we shall use the 
terms “real” DPCM and “ideal” DPCM for analyses based on prediction 
with and without error feedback, respectively. 


Example 1: We show the influence of the feedback on the prediction 
error in a DPCM scheme with a first-order predictor of value a. Let p = 
R,,(1)/o2 be the normalized mutual correlation between adjacent sam- 
ples. Without feedback the prediction error variance is 
of = (1+ a2—-2ap)-o2 (14) 

with a minimum 

min{o2} = (1 — p2) + 02 (15) 
for @ = dp: = p. The DPCM difference signal is d, = x, — aXn—1 + agn-1. 
Its variance is given as 

1—2ap +a? 
fee SORE 
oq = 1- ea? Ox (16) 


on the assumption of a vanishing correlation between input and quan- 
tization error,!® and its minimum variance is obtained with!’: 


Gopt = els ey ee ~ 7? G : =) F (17) 
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Fig. 2—DPCM performance: SNR vs. value a of the predictor coefficient. Source: Gaussian 
first-order Markov source with correlation p = 0.9625. Quantizer: B-bit quantizer optimized 
for Gaussian signals. Predictor: Previous sample prediction with coefficient a. 


The value agp; is not critical, however. We use dop: © p to determine the 
total MMSE: 
‘ Lp? 

min{o?} = Gin api" o°. (18) 
Due to feedback there is an increase in variance by a factor (1 — ep?)—}, 
Figure 2 shows the dependence of the signal-to-noise ratio (SNR) on the 
value of the predictor coefficient for various B-bit quantizers and com- 
pares theoretical results obtained from eq. (18) with simulation results. 
It is seen that these are useful approximations in the vicinity of the op- 
timum setting of the predictor coefficient (this is the region where the 
difference signal is almost white noise, and the simulations reveal that 
the assumption of a vanishing correlation between quantization error 
and input signal holds in this case). 


In an “ideal” Nth order DPCM system the prediction of an input 
sample x, is based on previous input samples x,_;;j = 1,2...,N. Hence 
the prediction scheme is essentially that of D*PCM (see Fig. Ic), but it 
is important to realize that these schemes are optimal for different fre- 
quency responses P(w) of the predictor as will be seen later in this paper. 
Equation (12) shows that the optimum predictor in DPCM is the MMSE 
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predictor. The prediction error variance is given as 


of = - f- Sx (w)]1 — Pw) |2de (19) 


and its minimum is reached if the random sequence {d,,} is a white noise 
sequence of variance 72 (full-whitening). Thus we have 


min{og} = 77, (20) 


where 72 is the minimum prediction error variance to be obtained from 
the random sequence {x,,} by passing its samples through a prediction 
error filter with frequency response 1 — Ppp: (w) such that 


Sx (w)|1 _ Popt (w) |? a n?. (21) 


We note that this minimum can be obtained for any stationary random 
sequence if the predictor impulse response is of semi-infinite length, and 
an Nth order predictor can be employed if the random sequence is 
Markovian of order N. For a given pds S,(w) Kolmogoroff?° has given 
the minimum of the prediction error variance as 


n = min{o3} = exp E ie logeSx (o)de | (22) 
wT —7 


This variance 72 is a positive quantity if the process is undetermined, 
i.e., if S,(w) is zero at most at a countable set of frequencies. Otherwise 
the signal is perfectly predictable and the prediction error variance is 
zero then. The normalized prediction error variance 


, exp E f_ log.S. (w)de | 
2 Ny Qn —T 
ee oe ee (23) 


a * (" s.(w)d 
x J_, Se ss 


is called spectral flatness measure”! and its inverse is the optimally ob- 
tainable prediction gain. The spectral flatness measure can be inter- 
preted as the ratio of the geometric mean of the pds S,,(w) to its arith- 
metic mean. It is easy to show?! that 


0<72<1. (24) 
From eq. (12) we finally find that 
min {o7} = ¢2- 72, (25) 
DPCM 


2.3 D*PCM 


The quantization error with pds S,(w) = a2 is filtered by the linear 
decoder network with frequency response H(w) = G~1(w) (see Fig. 1c). 
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Thus the total error variance is 
1 7 
a? = g2.— f |H()|2du, (26) 
2a —T 
i.e., the quantization error variance is increased by the power transfer 
factor 
1 v 
a=— | H(w)|2dw. (27) 
Qa =—1 


The variance of the quantization error depends on the variance of the 
difference signal d*,,: 


1 Tv 
obs Ge oe = ‘{; Sz (w)|G(w) |2deo. (28) 


Thus the total error variance is 
o7= €2 a+ oFs, (29) 


ie., the total error variance of D*PCM is a times that of an equivalent 
DPCM coding scheme (with o% = o%. in the case of fine quantizing and 
identical prediction filters). The impulse response of the postfilter is a 
sequence {1,h1,ho, - - +}; thus we have 


t 
it 


(30) 


and we find that DPCM sila D*PCM for any choice of the pre- 
dictor network.22 This result also implies that the optimum performance 
of DPCM is better than the optimum performance of D*PCM. This fact 
will be shown very shortly in a slightly broader context. 


Example 2: We calculate now the total MSE of a simple D*PCM scheme 
which employs a first-order predictor of value a. The prediction error 
variance has been given in eq. (14). The power transfer factor of the 
decoder network is 


a=1+a%+at+---=(1-—a?)71. (31) 
Thus the total MSE is 
52 5 k= 2op ae. 6 


(32) 


2 
PH=eeacoe= er ae am 


Note that MMSE prediction (a = p) results in the same MSE as that of 
standard PCM,? and that of > © for any p if a > 1. The minimum total 
MSE is obtained for 


ie “(1 sie): (33) 
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Fig. 3—D*PCM performance: SNR vs. value a of the predictor coefficient. Source and 
coder same as in Fig. 2. 


and its value is 
min{o7} = 2 V1 — p2- a3. (34) 


Thus it is possible to reduce the total noise variance by a factor V1 — p? 
instead of (1 — p2) as in ideal DPCM (with negligible feedback). It is in- 
teresting to note that this is the same reduction that we can obtain with 
a block quantization scheme of blocklength 2 (this is ascheme where the 
sum and the difference of adjacent samples are quantized indepen- 
dently).28 Figure 3 demonstrates this fact that the SNR is lower than that 
obtainable with DPCM (Fig. 2). In the case of quantizing with one and 
two bits/sample there is, as expected, a significant difference between 
theory and measurements especially for a-values which are not close to 
the correlation coefficient p. This difference is now explained by the fact 
that, in the case of coarse quantization, the white noise assumption |eq. 
(8)] does not hold for correlated quantizer input samples. Simulations 
have revealed that higher signal-to-noise ratios can be obtained by 
choosing a decoder coefficient that is higher than that of the coder. Note 
also, from Fig. 3, that no gain over PCM is obtainable for the specific case 
a =p. 

We come back now to the total MSE of D*PCM. Equation (29) is given 
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explicitly as 


of = 2 E f_ Sx (w)]1 - Pw)/?de | 


x|5- Slt Pel-*de | (35) 


where we have used our assumptions G(w) = 1 — P(w) and H(w) = 
G~1(w) (both assumptions will be reviewed in the next section). Note 
that the term in the first brackets reduces to 72 if P(w) is the MMSE 
predictor [full-whitening, see eq. (21)]. The difference signal d*,, is white 
noise then, and, since the last term in ot (35) is the power transfer factor 
for white noise inputs, it equals 02/03 = o2/n2. Therefore the total MSE 
is of = «2+ o2, and thus equals that of PCM.? Note that this statement 
is based on assumptions given in the remark in Section 2.2. We relate 
now the minimum total MSE obtainable with D*PCM to that of DPCM. 
By applying Schwarz’s inequality? to eq. (35) we obtain 

min {a7} >? | 5= SL v¥E@ do |= eos (36) 

D*PCM 2a J—n 
where o2; defines the variance of a process with pds VS, (w) . 
Equality is obtained if 


|1 — P(w)|? V'S,.(w) = const. (37) 


We conclude that the squared-magnitude frequency response of the 
D*PCM encoder has to be inversely proportional to the square root of the 
pds of the input (half-whitening). Therefore the optimum frequency 
response 1 — P(w) is a MMSE prediction error filter for a pds VS, (w) . 
The corresponding minimum 72; of the prediction error variance would 
be just the square root of that to be obtained from an optimum prediction 
of a process with pds S,,(w) [this can be verified from eq. (22)]: 


|1 — P(w)|? v Sx (@) = 1z= (38) 


However, it is important to realize at this point that it is not certain that 
the equality in eq. (37) can be obtained at all in practice. Indeed, equality 
can only be expected if VS,.(w) happens to be a rational spectrum, since 
such a spectrum can always be modeled by white noise passed through 
a purely recursive linear filter with poles inside the unit circle. It is clear 
that such a process can be whitened. by a one-step ahead prediction error 
filter 1 — P(w). In general, however, the pds VS, (w) is not rational, and 
the minimum total error variance to be obtained by D*PCM coding is 


7 fix)dx - $7 fE(x)dx = [f7 file )foledde)* with equality if f7(x)/f(x) = const. for any 
square-integrable functions f;(-) and fo(-). 
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greater than the right-hand term of eq. (36). Example 2 has shown that 
an improvement over PCM can be obtained with D*PCM. By applying 
Schwarz’s inequality to the right-hand term of eq. (36) we have 


1 T 2 
eg E f_ VS, (w) do | <eoz= mi {a7} (39) 


with equality only if {x,,} is a white noise sequence, i.e., an improvement 
over PCM can be obtained for all nonwhite processes. Equation (30) has 
already indicated that DPCM outperforms D*PCM for any nonwhite pds 
S,(w). We can make this statement more quantitative by comparing the 
total error variances of D*PCM (in the most favorable case, given if the 
pds V'S,.(w) is rational) and that of DPCM. To be more specific, we use 
eqs. (22)—(25) and (36), and find 

DPCM 


} exp & S. logeS (ode | 
< Qa —T 


ane [Lf smel 


D*PCM 


min {o? 


2 [eel 
x x 
=P =|" | =. <1. (40) 
Ove | OVE 


We conclude that the ratio of the total error variances of DPCM and 
D*PCM is upperbounded by the squared spectral flatness measure 2; 
of the pds V'S,.(w) . The validity of.the right-hand inequality in eq. (40) 
has already been stated in eq. (24). Equality is obtained if and only if {x,} 
is a white noise sequence. In other words, DPCM outperforms D*PCM for 
all nonwhite spectra. 


Example 3: Example 1 has shown that the total MSE of a DPCM scheme 
with a first-order predictor is smaller by a factor 6? = 1 — p? than that 
of PCM (provided that the quantizer performance factors are identical 
in both cases). In D*PCM, the corresponding factor is 6? = V1 — p? (see 
example 2). The value p is in both examples the one-lag normalized 
autocorrelation coefficient of an otherwise arbitrary (though stationary) 
random input sequence. Let us assume now that this sequence is a 
first-order Markovian sequence with autocorrelation 


R,(k) = plFl- of (41) 


and pds 


1 — p? 9 
So) = re (42) 
1+ p2 — 2p cos w 


PREDICTIVE QUANTIZING SCHEMES 1511 


The best DPCM performance is still obtained with a first-order predictor 
since {x,,} is Markovian. The D*PCM reduction factor 6? = V1 — p? is not 
optimal, however. It is interesting to compare it with the upper bound 
to be obtained without the constraint of the realizability of the prefil- 
ter. 

Using eqs. (36) and (42) we find 


min {07} > ef - 82-62 (43) 
D*PCM 
where 
2 
p2 = (1-2) G) F (5.0). (44) 
w 2 


F(z/2, p) is the complete elliptical integral of the first kind: 
a/2 
F G ; p) = §, = 
2 0 V1—p?sin2¢ 


7 1\2 1-3\2 
=7*114+(= Pe) tee 45 
2 (3) re Nea} P >) 
We conclude that a D*PCM coding of a first-order Markov source is up- 
perbounded by a reduction factor 


64 


instead of (1 — p?) for DPCM and V1 — p? for the one-tap D*PCM. For 
p = 0.9625 the one-tap D*PCM coding results in a total MSE that is by a 
factor 3.7 greater than that of (ideal) DPCM, and it is lower-bounded by 
a factor 3.0 (for p = 0.85 the corresponding values are 1.9 and 1.8, re- 
spectively). 


st=(1—p%) [14 702+ atte] (46) 


The foregoing discussion has shown that D*PCM is a suboptimal coding 
scheme if the performance criterion is the unweighted total error vari- 
ance. The next section will demonstrate that D*PCM is an optimum 
coding scheme for a specific frequency-weighted error criterion, and 
Section IV will show that its performance bounds the overall perfor- 
mance of DPCM coders in the presence of channel errors. As a final re- 
mark we mention that D*PCM based on adaptive prediction with a sep- 
arate transmission of the predictor coefficients has been used recently 
for speech coding purposes”‘ to improve the subjective performance. 


Ill. NOISE-FEEDBACK CODING 


In the preceding chapter we have optimized pre- and postfilters for 
a quantizing scheme (D*PCM) and have shown that DPCM has a superior 
performance for all nonwhite processes. At a first glance this seems to 
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Fig. 4—Noise-feedback coding scheme. 


be surprising since D*PCM coding is also based on linear filtering. In 
the D*PCM analysis, however, we did not take into account that not only 
the pre- and postfilters are under our control, but also the quantizer. To 
be more specific, we may additionally filter the quantization noise 
whereas in typical pre- and postfiltering applications it is the channel 
that is perturbed by noise; that noise is not under control of the designer. 
Figure 4 demonstrates how the filtering of the quantization noise is ob- 
tained. The quantization noise, i.e., the difference between input and 
output of the quantizer, is fed back through a linear filter with frequency 
response Q(w) and is added to the input. Q(w) is required to have a 
minimum delay of one sampling time for stability reasons. The purpose 
of the feedback scheme is a reshaping of the spectrum of the quantization 
noise such that the total error variance is minimum. It should also be 
mentioned at this point that there exists also another linearly equivalent 
scheme, the direct-feedback coder, which has been studied by Brainard 
and Candy. It uses a prefilter of frequency response A(w), and the 
output of the quantizer (not the quantization noise!) is first passed 
through a feedback filter of frequency response B(w) and then added 
to the prefiltered signal to form the quantizer input. The equivalence 
to the noise-feedback coder is given by G(w) = A(w)/(1 — B(w)) and 
1- Q(w) = 1/11 — B(w)). 


3.1 Derivation of the basic formula 


Let us, as before, represent the quantizer as a device that adds sig- 
nal-independent white noise of pds S,(w) = «3 to the signal. Due to this 
assumption we can also replace the feedback-quantizer with a nonwhite 
noise source of pds 


Sn (w) a Sq(w){1 — Q(w)|? 
= a3 | 1 — Q(w)|?. (47) 


The feedback acts as linear filter on the open-loop quantizing noise, and 
the effective quantization noise is colored noise then. Additionally we 
may introduce a subjective noise-weighting function S,,(w) whose inverse 
describes the sensitivity of the sink to uncorrelated noise. A small value 


PREDICTIVE QUANTIZING SCHEMES 1513 


of S,,(w) indicates that a high error variance is acceptable in that specific 
frequency region and vice versa. It is well known that such a fre- 
quency-weighting can only serve as a first approximation of noise sen- 
sitivity since it does not take into account nonlinear effects and depen- 
dencies on local properties of the signals. Nevertheless, the frequency- 
weighted noise power is a quite popular design criterion partly due to 
the lack of better distortion measures. What is worth emphasizing is that 
Sw(w) can be interpreted as the squared-magnitude frequency response 
of a noise-weighting filter. Therefore it is reasonable to assume that 
Sy(w) is a rational function of w. S,,(w) can also be explained as the 
output pds of such a weighting filter whose input is a white noise process. 
This interpretation suggests to define a variance 


1 T 
2 — ee 
oh =5- {" Sulw)de, (48) 


a minimum prediction error variance 


1 T 
ris = exp | (" logeSo(wde |, (49) 
Qa —1 
and a spectral flatness measure 
Yo = nv ow (50) 


in accordance with eas. (7), (22), and (23). 
The MSE of a NFC coder is given by 


yee E f S_()|G(o) Pde | 
| 5. J_ So@ HOPI - Qe) "de | (51) 


where we have used eas. (26), (28), and (47). Note that the MSE does not 
include a linear distortion term resulting from a possible mismatch be- 
tween prefilter and postfilter. We are now free to choose linear filters 
G(w), H(w), and Q(w) such that the frequency-weighted error variance 
is minimized. This is obtained by a proper preshaping of the signal 
spectrum and the quantization noise prior to transmission. The general 
design of this NFC scheme has been studied by Kimme and Kuo in the 
context of picture coding.!4 Our objective is to show the connection of 
this noise-feedback scheme with those discussed so far and to show how 
subjective noise-weighting functions influence the design. Our approach 
will also be different from that in Ref. 14 since a constraint is not needed 
in the optimization procedure. The MMSE design problem is to find the 
optimum combination of prefilter G(w), postfilter H(w), and feedback 
filter Q(w) for a given pds S,(w) of the input signal, and a weighting 
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function S,,(w). Let us first assume that G(w) and Q(w) are given and 
let us determine H(w). In an optimized system the total error x, — yn 
must be orthogonal to the output of the postfilter and thus orthogonal 
to the data r, used in that filter (the sequence {r,} is the decoder input 
sequence). It can be shown that this condition also holds in the case of 
weighted total errors. If the postfilter H(w) is not constrained to be 
physically realizable (its characteristics can always be approximated 
arbitrarily closely by allowing for a sufficient time delay), the optimum 
filter is given by the Wiener-Hopf condition 


Srx (w) 
S;, (w) 


where S;z(w) = G*(w)«S, (w) is the cross-spectrum between the sequences 
{r,} and {x,,}. Thus we see that 


opt (w) = (52) 


G*(w)-Sz(w) . 
|G(w)|28,(w) + $,(0)|1 — Q@)/? 


We shall restrict our attention to the case of a small quantization noise 
variance: 


Hopt(w) = (53) 


[G(w)|?Sz(w) 


11 — Qo)? for all w. (54) 


Sq (w) <K 
Equation (53) becomes 


Hope (w) = (55) 


eee 
G(w)’ 
so that G(w) and H(w) are reciprocal filters for any given G(w) which does 
not violate the assumption of eq. (54). It is seen that choosing reciprocal 
coding and decoding filters is not just a convenience but a requirement 
by the MSE criterion. By applying Schwarz’s inequality to eq. (51), we 
find 


min{o?} = &2 E ff VCS — Q(w)|? do | (56) 


Qa 
for any given S,,(w) and Q(w). This minimum is reached if 
Sw 1- 2 
| Gope(w) |? = C2 Veo 30 (57) 


where C is a constant. Equations (51) and (56) can be used to calculate 
the error variances of various coding schemes. 


3.2 Optimization of the noise-feedback coder 


We shall now derive two conditions which have to be met by Q(w) and 
Gopt(w), respectively. We apply again Schwarz’s inequality to eq. (56) 
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and thus have 


min{o7} < é = f- Sx (@)Sy(w)|1 — Q(w)|2dw (58) 


with equality if the integrand is a constant. Note that the term 1 — Q(w) 
describes a prediction error structure. Therefore equality is obtained 
by choosing Q(w) to be the MMSE predictor of a random sequence of pds 
S,,(w)-S,(w) and we have [in the notation of eq. (21)]: 


S x (w)Sw(w)|1 — Qope(w) |? = 02, = 02 = 2 (59) 


where Qpop:(w) is the MMSE predictor that whitens a random sequence 
of pds S,.(w)-S,,(w), and where 7272 is its MMSE. The right-hand equality 
in eq. (59) can be obtained from Kolmogoroff’s result [eq. (22)] by sub- 
stituting S,(w) with its frequency-weighted version S,.(w)S,,(w). When 
comparing eqs. (56) and (59) we find 


min {o7} = €F + 02+ Nu (60) 

NFC 
We conclude that the frequency-weighted total error variance of an NFC 
scheme with given quantizer is determined by the product of the pre- 
diction error variances of the spectra S,(w) and S,,(w), and that Qop: (w) 
is the optimal predictor of a pds S;(w)-S,(w). We also have, from eqs. 
(57) and (59), that S;(w)|Gope(w)|? = const. This implies, however, that 
| Gopt(w)|? is a filter which whitens S,(w) which has been assumed to be 
rational. Thus a sufficient condition for an optimum NFC scheme is to 
choose as a prefilter an MMSE optimized prediction error filter: 


Gopt (w) Sle Popt (w) (61) 
where Pop: (w) is defined by 
Sx(w)|1 — Popt(w)|? = 2. (62) 


Note the important fact that the overall performance is optimized if the 
prefilter is an MMSE prediction-error filter that whitens the input process 
of pds S,.(w) and that this result holds for any choice of the weighting 
function. It turns out that we have to modify the quantization noise 
feedback loop but not the prefilter if a weighting of the noise has to be 
taken into account. We finally find from eqs. (20), (47), (59), and (62) 
that the weighted error spectrum S;(w) = ¢Z09S,,(w)|1 — Qop:(@)|? 
-|1 — Pope (w)|-? is constant and thus equals the total error variance [see 
eq. (60)]. We finally note that these optimization results can also be 
extracted from the Kimme and Kuo paper.!4 Musmann has recently 
derived equivalent results based on a more information-theoretical 
analysis.?° 
We shall now briefly discuss two special cases. 
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Fig. 5—Model of NFC coder with prediction error filter as input. 


Special case: S,,(w) = 1 for all w (DPCM). A comparison of eqs. (59) and 
(62) reveals that Qop:(w) = Pop: (w), i-e., the predictor prefilter and the 
feedback filter have to have identical frequency responses. It is easy to 
see?® that these filters can then be combined to yield the DPCM structure 
of Fig. 1b. A DPCM structure is indeed defined by having Q(w) = P(w) 
and Fig. 5, which is a model of a noise-feedback coder, reveals that no 
frequency-weighting of the quantization noise is possible in this case for 
any choice of the predictor (provided that pre- and postfilters are re- 
ciprocals), since the quantization noise passes both the feedback filter 
with frequency response 1 — P(w) and its inverse, the postfilter. 


Special case: S,,(w) « Sz1(w) for all w (D*PCM). A weighting function 
that is in some sense inverse to the pds of the signal is of importance for 
quantizing acoustic signals since it may avoid a masking of weak signal 
energies in specific frequency ranges by the quantization noise. 

For S,,(w) « Sy'(w) we find from eq. (59) that |1 — Qopz(w) |? has to 
be a constant. This clearly means that Qop:(w) = 0 for all w; i.e., the best 
coding scheme is now D*PCM. The prefilter is, of course, as for all optimal 
NFC coders, a whitening filter. The D*PCM scheme is suboptimum if it 
is used in connection with other weighting functions. The special case 
of S,,(w) = 1 has been discussed in detail in Section II; we have seen there 
that for S,,(w) = 1 and a MMSE predictor P(w) no reduction in total error 
variance over PCM is obtainable. The above discussion reveals that the 
same coder is, however, optimal for the specific noise-weighting function 
S,,(w) « Sz!(w). Indeed, subjective gains of about 6-10 dB have been 
reported for speech signals, for this choice of the prefilter.2®27 


Table I lists various NFC configurations. The performance of some 
suboptimal coders will be compared with the optimal NFC scheme in the 
next part of this section. An elementary example will suggest the manner 
in which the NFC design influences the total error variance. 


Example 4: Assume a noise-feedback coder with just one tap, i.e., Q(w) 
= q-exp(—Jw), and a prefilter with the structure of a one-tap predic- 
tion-error filter: G(w) = 1 — a-exp(—/jw) (see Fig. 6). Further assume a 
sequence with an adjacent-sample correlation p. We have H(w) = G~1!(w) 
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Table | — Performance comparison of NFC coder configurations 










DPCM 
Noise weighting Q(w) = P(w) 
Sy(w) = 1 = DPCM Optimal coder: Suboptimal coder (upper 
(no weighting) J1 —P|2«S;! bound half-whitening): 
‘ ji Pla So 
Sw(w) « S,—-'(w) = D*PCM Suboptimal Optimal coder 
coder (full-whitening): 
ie 2=|1—P|2« Sz} 
All other Optimal Suboptimal ‘Suboptimal coder 
cases: coder coder 


and the unweighted total MSE [S,,(w) = 1] can be derived from eq. (51); 
we shall omit the intermediate steps. The final result is 


p. pita? —2ap 


Of = €% (1 + qg2— 2aq) - oc? 


1-a? 
a <1;q arbitrary. (63) 


For a = 0 and q = 0 we have the PCM result of eq. (10). For a = 0 and fi- 
nite q we obtain of = (1 + q?)-0%, ie., noise feedback without prefil- 
tering increases the MSE by a factor 1 + q? over that of PCM. For g = 0 
and finite a < 1 we have the case of D*PCM, i.e., prefiltering followed by 
quantization [eq. (32)]. The best choice of q is q = a if a is given; the 
scheme reduces then to DPCM [eq. (1)] and reaches its MMSE for a = q 
= p. The different cases-have been tabulated in Table II. 


3.3 Suboptimal coding schemes 


The last section has shown that noise-feedback coding is a scheme that 
allows for the optimal shaping of the spectra of the input signal and the 
quantization noise such that the noise-weighted overall error variance 
is minimized. The prefilter is in all cases a MMSE prediction error filter 
1 — Pope (w) which performs a decorrelation of the input signal. The op- 
timal scheme reduces to DPCM (Qope(w) = Pope(w)) in the case of un- 
weighted noise (S,,(w) = 1), and to D*PCM (Qop:(w) = 0) in the case of 
Sw(w) « Sz1(w). In all other cases, only the general NFC scheme (with 
Q(w) # P(w)) is optimal. 





Fig. 6—One-tap NFC coder of Example 4. 
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Table II—Error variances of coding schemes with one predictor 
coefficient aop1 and one feedback coefficient Gopt 


min{o?} opt Qopt 
PCM é =o 0 0 
Noise-feedback PCM e(1+ q2)o2 0 0 
D*PCM &V1—p* 0 0 FuSNvaa 
p 

DPCM (ideal) é (1 — p2)o2 p p 

: : 1— p? 
DPCM (real, i.e., with q7—a o Gopt <p [see eq. (17)] 
coarse quantization) €qP 


This section compares the performances of various suboptimal coding 
schemes which belong to the class of NFC schemes and which are derived 
by choosing suboptimum prefilters and feedback filters. The corre- 
sponding error variances can be derived directly from eq. (51). We shall 
omit the intermediate steps in the calculations and optimizations of «7 
and shall present the final results only. 

PCM. PcM results if G(w) = H(w) = 1 and Q(w) = 0. We then have 


min {o7} = «20203. : (64) 


PCM 
The quantity o7, as defined in eq. (48) represents the subjective gain if 
white quantization noise is weighted. 

Noise-Feedback PCM. We have G(w) = H(w) = 1, and Qopt(w) is the 

MMSE predictor of the weighting function S,,(w). Hence we have 

min = {a7} = eZo2ni3. (65) 

noise-feedback 
PCM 


This result shows that 72, as defined in eq. (49) represents the error re- 
duction obtainable if the quantization noise is optimally shaped. Note 
that optimal noise-shaping reduces the weighted error variance in 
relation to PCM by a factor y2, = 72/02, which is the spectral flatness 
measure of the weighting function S,, (w). 


Remark: We have to mention at this point that eq. (65) only holds if y2, 
is not close to zero. This implies that the predictibility of S,,(w) (which 
is bounded 'by the dynamic range of the spectrum?) should not be too 
high. Otherwise one could obviously achieve large reductions in error 
(including unbounded ones for weighting functions which are zero over 
a finite segment of the frequency axis; see eq. (22) and the following 
discussion thereof). The noise reductions are obtained by shifting the 
noise in frequency to a range where it is less heavily weighted by S,, (w). 
However, the noise-shaping increases the total variance of the quantizer 
input signal since the correlated quantization noise is added to the input 
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signal. Therefore the quantizer step sizes have to be readjusted ac- 
cordingly. As a consequence we obtain an increase in the quantizer 
performance factor é and subsequently an increase in total error vari- 
ance. Spang and Schultheiss?9 have discussed in detail the stability 
problems involved. They have been interested in noise reductions in 
oversampled systems where the noise can be shifted into the high-fre- 
quency range and subsequently eliminated by lowpass filtering. In that 
application we have a weighting function that is zero over a finite seg- 
ment of the frequency axis. As a means for reducing the stability prob- 
lems Spang and Schultheiss propose to keep the number of feedback 
elements R finite. In this case the total error variance is given as 


1 w 
a= 5 f Sule) 


The feedback network can be viewed as an Rth order predictor. Its op- 
timal coefficients q,; k = 1,2,..,R can be derived from the set of R 
normal equations whose coefficients are just the Fourier coefficients of 


Sy (w). 89 
D*PCM. We have G(w) = 1 — P(w) and Q(w) = 0. 


(i) Lower bound (half-whitening). The lower bound is reached if 
[1 — Pope (w)|? = VS, (w)/S,(@) . We obtain 





R 12 
1— > qpeko| dw. (66) 
k=1 


min {07} = efotau (67) 
D*PCM 
where 
: 1 
pee f «/ Says Sila) de. (68) 
Qn —T 


(it) Full-whitening. Let P(w) be the MMSE predictor for S,(w). It 
follows that 


Of = €50xw (69) 
where 
y= 5- J Sx(e)Sulwide. (70) 
Qn =< 
DPCM. Let P(w) = Q(w) be the MMSE predictor for S,.(w). It follows 
that 
min {o7} = eqnzon (71) 
DPCM 


Note that optimal prediction has reduced the weighted error variance 
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in relation to PCM by a factor y2 = n2/o2, i.e., by the spectral flatness 
measure of S,(w). 

Prefiltered DPCM. Let Q(w) be the MMSE predictor for S,.(w). DPCM 
results if P(w) = Q(w). Prefiltered DPCM results if P(w) + Q(w). By 
varying P(w) we obtain 


min {o7}= ejniova (72) 
Prefiltered 
DPCM 
where 
1 T 
ome 5 f. VS,(a) du. (73) 


From Schwarz’s inequality we have o173 < o7,. Hence we find the result 
that the performance of DPCM can be improved by employing an addi- 
tional prefilter. It is worth emphasizing, however, that we have set Q(w) 
to be the MMSE predictor for S,(w). The optimal NFC scheme results if 
we are also free to optimize this feedback filter (see below). 

NFC. The minimum total error variance of NFC schemes has already 
been given in Section 3.2 and is repeated here for completeness: 


min {o7} = egnins- (60) 


NFC 
Note that optimal prediction and optimal noise shaping properties are 
provided by NFC schemes. Accordingly the total error variance is reduced 
in relation to PCM by a factor 2 - y2, which is the product of the spectral 
flatness measures of input spectrum and weighting function. We shall 
see very shortly that this scheme is very close to theoretical bounds in 
the case of Gaussian input sequences. 


Example 5: We shall now evaluate the above derived results for the 
specific example of a first-order Markov source of variance o2 and nor- 
malized adjacent-sample correlation p = 0 whose power density spectrum 
has already been given in Example 3. Such a source is a useful first ap- 
proximation for modelling the statistics of speech (with p = 0.85 in this 
example) and of television signals (with p = 0.9625 in this example). 

Speech signals. We assume a weighting function which is inversely 
proportional to the pds of the signal. We have already mentioned in 
Section 3.2 that subjective gains of about 6-10 dB have been reported 
for this specific weighting. The foregoing analysis has also revealed that 
the optimal scheme, i.e. NFC is identical to D*PCM. We have 


Sw(w) = c?- Sy'(w) (74) 
where c? can easily be determined if S,,(w) is normalized such that 
max {S,,(w)} = 1 (75) 
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Table Ill — Comparison of coder design parameters 





Sw (w) =1 Sw(w) « Sri (w) 
ne (1 — p?)+o2 (1 — p2)-02 
1 
2 1 Js. 
uy (1+ py? 
9 1 + p? 

1 eee ee 
me (1+ py? 
oz, o2 ne Np 
ova 2k ( p) it = g 

T 2° lt+p ™ 


and if, in addition, use is made of the following result: 


1 — 

P ot. (76) 
lt+op 
Table III lists the important quantities for S,,(w) = 1 and S,,(w) « 


Sz!(w). F(,-) is the normal elliptic integral of the first kind [see eq. (45)], 
and E(-,-) is of the second kind. We have 





min {S,.(w)} = 


1 3 
E (5) lee cae 
2 2 4 64 ee 
The quantity k is given by 
k = V4p/(1 + p) (78) 


and E(z/2, k) is close to unity for p = 0.85. 

From Table III we find that the total error variance can be reduced 
by a factor y2 = 72/02 = 1 — p? by means of optimal prediction and ad- 
ditionally by a factor y2 = n2/o2 = (1 + p)~2 by means of optimal noise 
shaping. For p = 0.85 the obtainable improvements in weighted signal- 
to-noise ratio are 5.6 dB and 2.4 dB, respectively. It is interesting to note 
that a noise-shaping improvement of about 1 dB has been calculated in 
Refs. 25 and 31 on the basis of experimentally determined speech spectra 
and noise-weighting functions. The total improvement is 8.0 dB for 
D*PCM or NFC, and it is 6.5 dB for prefiltered DPCM. 

The achievable error variances of various coding schemes can be de- 
termined by using eqs. (60) and (64)—(73): 





La pe 
ge (79a) 


PCM: min {07} = e? (+p? o% 
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it 
Noise-feedback PCM: min {o7}= &? G+)? o (79b) 
p 





pl = 
D*PCM and NFC: min {o7} = &2 P o (79c) 
(1 — p)(1 + p?) 
DPCM: j= 2 —=e 79d 
min {07} = 2 a+ 9) a (79d) 
Prefiltered DPCM: min {a7} © €2(1 — p?) (= ) of. (79e) 


Television signals. An average video spectrum is flat for frequencies 
below the line rate and falls at about 6 dB per octave through the rest 
of the band. A weighting function for such a signal has a negative slope 
of about 3 dB per octave.7:3132 We assume a weighting 


S wo) = c2 VS, (w) (80) 


and it is not difficult to show that the reduction factor for noise shaping 
is 
T 1 
= = (81) 
ar (F ) eo pe ee 
ep 42 Peak 

for this choice of the weighting function. Using p = 0.9625, we find a 
performance improvement of 11.3 dB obtainable by optimal prediction 
and a noise-shaping gain of 2.4 dB. This latter figure is in good agreement 
with the results of calculations based on experimental data.?°:3! 


3.4 Absolute performance bounds 


In the foregoing sections we have optimized various coding schemes 
whose structures had been given beforehand. It is useful to compare the 
performance of this class of encoders-decoders with absolute perfor- 
mance bounds by consulting the distortion-rate function.!2 A detailed 
discussion of these bounds for speech and television signals has been 
given by O’Neal.*! In the following we shall restrict our attention to 
source coding of stationary ergodic Gaussian processes. For a fixed rate 
R there exists a minimum possible average distortion D which is a lower 
bound for any coding scheme. As above we adopt a frequency-weighted 
mean-squared error as our distortion measure. D and R are then related 
parametrically as follows:33:44 


Die) => — (" minfe, Sule) + Se(w)ide (32) 


R(g) =— ae max (0, loge ee (83) 
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Next we define the weighted error spectrum by 


1 T 
D=— f Side (84) 
Qn —7T 
and, by comparing it with eq. (82), we find 
g ¢ < Sy(w) - S,(w) 
S = 85 
i(o) le: (w)S, (w) otherwise. ae) 


Over the frequency range where ¢ > S,(w)-Sx;(w) we have S;(w) = 
Sw (w)+Sx(w). This implies that no signal has to be transmitted over this 
frequency range since such a measure produces just this error spectrum. 
Over the frequency range where ¢< S,,(w)>S,(w) we have S;(w) = g, ie., 
the weighted error spectrum must be constant. Section 3.2 has shown 
that this requirement is met by the NFC scheme. In this latter case of 
small distortions we have D(y) = y= D and, by combining eas. (22), (49), 
and (83), we obtain . 


D = 2-?R . nie ni. (86) 


A comparison with eq. (60) indicates that optimal NFC coding is very 
close to the distortion-rate bound D. The difference is in the first 
right-hand term of these equations because «7 > 2~2" for single-letter 
quantizers (see Table IV). We finally note that the three right-hand 
terms in eq. (86) correspond to the terms Tz, Tp, and Ts in O’Neal’s 
paper.?! 


iV. TRANSMISSION ERRORS 


Noise-feedback coding schemes (including D*PCM and DPCM) are 
affected differently from PCM systems by bit errors on the communi- 
cation channel because the decoder loop causes an error propagation 
while a PCM error does not propagate in time. The objective of this sec- 
tion is to show the effects of transmission errors in predictive coding 
systems using some of the results of our above analysis. We shall con- 
centrate on two coding schemes, D*PCM and DPCM, and we shall only 
use the unweighted mean-squared error criterion. Recall that DPCM and 
NFC are identical in this case. 


4.1 PCM 


Let us assume that quantizing noise g, and channel noise c,, can be 
modelled as additive noise sources. Thus the total error is 


tn= dnt Cn (87) 


and its variance is 


‘= a7 + 02+ 2E[qn- en]. (88) 
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Table IV — Quantization error variances é (Max-quantizers) and 
channel coefficients 


2 


Bae Fee et eet 

1 bit 2 bit 1 bit 2 bit 
Uniform pdf 0.25 0.0063 3.0 3.75 
Gaussian pdf 0.363 0.118 2.55 4.65 
Laplacian pdf 0.5 0.176 2.0 5.3 
Gamma pdf 0.667 0.232 1.33 6.28 


Totty and Clark have shown** that channel errors and quantization 
errors are uncorrelated if the quantizer structure is that of Max.°6 These 
quantizers minimize the variance of the quantization noise but not 
necessarily that of the total error. This approach is of interest if a coding 
scheme has to operate on noisy channels with small bit-error probabil- 
ities which additionally are unknown or changing." It is also justified by 
our observation that the step-size of quantizers with a low number of 
levels is not critical. The channel error variance depends on the bit-error 
probability P, on the density function of the signal being quantized, and 
on o2 (because the input variance determines the quantizer step-size 
scaling). Thus we have 


min {a2} = €2- 6? (89) 
PCM 


and the normalized channel error variance can be written as 
e=y-P (90) 


provided that the codewords are only affected by single bit errors. The 
channel coefficients y can be derived following an approach in Ref. 39. 
Table IV lists these values for 1-bit and 2-bit quantizers.!9 In the case 
of 1 bit, the channel coefficient is simply given as 


y=4-(1—). (91) 


In Table IV the y-values for 2-bit quantizers are given for the folded 
binary code with the exception of the uniform probability density 
function whose y-value is lower in the case of a natural binary code and 
is given by:4° 


62 = 4-P(1 — €?) (92) 
where 
ce, = 2-28 (93) 
with B as the number of bits. 


+ A re-optimization of quantizers for noisy channels has been discussed in Refs. 37 and 
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The total error variance can thus be calculated as 


min {o7} = (e+ €2)-of = (B+ y-P)-o;% (94) 
PCM 
on the assumption of a vanishing correlation between the two errors. 


4.2 Noise-feedback coding 


The analysis of the predictive quantizing systems in the presence of 
channel errors is essentially identical to that of the last section if we 
substitute the nonwhite noise source S,,(w) in eq. (47) with 


Sn(w) = 62|1 — Q(w)|? + o2 
[eg1 — Q(w)|? + 2] - of (95) 


and if the assumption of eq. (54) still holds. Thus it is possible to reop- 
timize the various coders by following the same procedure as in the last 
section. We shall concentrate in this section on two coding schemes, 
D*PCM and DPCM. The objective here is to show that the D*PCM per- 
formance provides a bound for all predictive quantizing schemes if the 
channel is noisy. 

For both schemes the contribution of the channel errors o? on the total 
error variance can directly be derived from the D*PCM results of Section 
2.3 by replacing <2 with «2. We then have, from eq. (29), 


D*PCM: a2 = a “ae o4s 
DPCM: o2 = e2-a- a3 (96) 


where a is the power transfer factor of eq. (27). Therefore the variance 
of white noise on the channel is increased in the decoder network by a 
factor a = 1. This error accumulation does not imply that the effect of 
transmission errors in D*PCM and DPCM is more severe than in PCM, 
because the generated noise variance depends now on the variance of 
the difference signal and can thus be influenced by the prefilter. Gains 
over PCM even for noisy channels have indeed been reported recently.4!-42 
The discussion of Section 2.3 has already shown that the total MSE can 
be smaller than that of PCM; it has also shown that coder and decoder 
should have inverse networks. 


4.2.1 D*PCM 


Transmission errors contribute to the total error in exactly the same 
way as quantization noise. The total error 


tn = (dn + ¢n) * hn (97) 


has a variance 


of = (e + 62) - a+ os. (98) 
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An optimized D*PCM scheme minimizes at the same time both error 
contributions. The mean-squared errors are those given in Section 2.3 
if & is replaced with «2 + &. Note that the optimal filters in D*PCM 
schemes do not depend on the bit-error rate. We have seen that half- 
whitening of the input spectrum provides a D*PCM performance bound. 
From eq. (36) we conclude that the minimum channel error variance is 
given as 


min {o27}= @otp. (99) 
D*PCM 


4.2.2 DPCM 


Section 3.2 has revealed that D*PCM has the same total quantization 
error variance as PCM if the input is full-whitened (MMSE prediction). 
The D*PCM postfilter is then identical with that of DPCM if this latter 
scheme has been optimized for the noiseless channel. These observations 
imply that transmission errors cause the same channel error variances 
in DPCM as in PCM if an MMSE predictor is employed. Smaller MSE 
values, i.e., improvements over PCM, can be gained by reducing the 
whitening effect. The quantization noise MSE, however, increases then. 
The total error 


b= Onck Ca ie (100) 


has a variance 
o7 = (e + ae?)o%. (101) 


In the case of high bit-error probabilities (ae? >> ¢?) the total MSE is 
minimized if the prediction network performance is close to that of an 
optimized D*PCM coder and eq. (99) provides a lower bound in channel 
error variance for DPCM (and NFC) in the case of noisy channels. For low 
bit-error rates the best predictor will have a characteristic between 
full-whitening (error-free transmission) and half-whitening (D*PCM 
bound for noisy channels). Therefore 


min {o2}> min {o2}> min {02} (102) 
PCM DPCM D*PCM 


Example 6: Assume a previous-sample 2-bit DPCM and a Gaussian (not 
necessarily Markovian) source with adjacent-sample correlation p = 

0.85. Let the bit-error probability be P = 0.05. From eq. (90) and Table 
IV we have e, = 0.118 and «; = 0.233. Figure 7 shows the dependence of 
the signal-to-noise ratio of this DPCM scheme on the value a of its pre- 
dictor coefficient both for the noiseless and the noisy channel. The 
theoretical results have been calculated from eq. (101) using eqs. (16) 
and (31). 
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THEORETICAL RESULTS 
@@@ OOO SIMULATION RESULTS 


SNR (dB) 





0.4 0.5 0.6 0.7 0.8 a=p 0.9 1.0 


Fig. 7—DPCM performance on noisy channels. Two-bit quantization of Gaussian source 
with adjacent-sample correlation p = 0.85. Folded binary code with bit-error rates P = 
0 and P = 0.05. 


It is seen that DPCM performs better than PCM if the predictor is ap- 
propriately chosen. The differences between theory and measurements 
for low values of the predictor coefficient are again a consequence of the 
fact that the quantization error has been assumed to be not correlated 
with the input signal. For quantizers with a low number of levels this 
assumption only holds if the signal being quantized is uncorrelated, i.e. 
for predictor coefficients close to p. Notice that the crosspoint between 
PCM and DPCM performance is reached for a value of a which is slightly 
higher than p; this deviation from the predicted performance is a con- 
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PREDICTOR 





Fig. 8—Noiseless channel feedback DPCM. 


sequence of the noise feedback. Figure 7 also compares the DPCM per- 
formance in the case of a noisy channel with the equivalent D*PCM 
performance which has been obtained from eqs. (98), (14), and (31). It 
is apparent that this one-tap D*PCM performance is a very useful bound 
of the DPCM performance. The optimal value of the DPCM predictor 
coefficient is also close to the optimal D*PCM coefficient app: as given 
in eq. (33) (Gope © 0.57 for high bit-error probabilities). The choice of agp: 
in accordance with eq. (38) for very noisy channels has first been men- 
tioned in Ref. 41. 


4.2.3 Noiseless channel feedback DPCM 


In D*PCM the predictors of coder and decoder operate on slightly 
different signals, because there are quantizing noise and channel noise, 
respectively, in the intervening path. In DPCM both predictors operate 
on the same signal, viz. the sequence of reconstructed samples, if only 
the channel is noiseless. In the case of channel errors the predictions are 
again different, and the channel noise has the same effect on the overall 
MSE as in D*PCM. Let us assume that a noiseless channel from decoder 
to coder is available (see Fig. 8). It is then possible to ensure identical 
predictions via the feedback channel by retransmitting the prediction 
values calculated at the decoder. Notice that this scheme is identical to 
standard DPCM if the transmission of the encoded difference samples 
to the decoder is noiseless. In the case of channel errors, however, we have 
a total error 


n=, FO, (103) 

of variance 
c= (é + €?) - 6%. (104) 
The contrast to D*PCM and standard DPCM is quite clear; the feedback 
is now around quantizer plus noisy channel, and thus error accumulation 


in the decoder loop has been totally avoided. 
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Example 7: In the case of previous-sample prediction the results of 
Example 1 (DPCM with analysis of the effects of noise feedback) apply. 
The optimum choice of the predictor-coefficient is given by eq. (17) if 
e is replaced with «2 + ¢2. To demonstrate the improvements let us 
use the figures of the previous example. We have eG + ef = 0.851, and 
hence dop: = 0.755. We finally find [from eqs. (16) and (104)] a signal- 
to-noise ratio of 9.0 dB instead of 7.7 dB in the case of DPCM without 
noiseless channel feedback. 


V. SUMMARY 


There is a great interest in low bit-rate transmission of speech and 
television signals. Especially for acoustic signals it is well known that 
the subjective performance of a coder is strongly affected by the way in 
which quantizing distortion is distributed in frequency. In this paper 
we have compared coding schemes which employ prediction to exploit 
the inherent redundancy of these signals and which employ noise- 
shaping for optimizing the subjective performance on the basis of a 
frequency-weighted error criterion. First we have shown that DPCM 
outperforms D*PCM (a predictive scheme that lacks the feedback around 
the quantizer) for all nonwhite input spectra if the performance criterion 
is the unweighted total error variance. We have then used a noise-feed- 
back coding structure as a framework for a unified analysis of predictive 
quantizing schemes. With this structure a minimum frequency-weighted 
error variance can be obtained by a proper shaping of the signal spectrum 
and the quantization noise prior to transmission. A comparison of this 
error variance with the distortion bound as given by the distortion-rate 
function for Gaussian signals has revealed that the performance of the 
noise-feedback coder is almost optimal for this class of signals. We have 
also shown that this coding structure degenerates to DPCM in the case 
of unweighted noise, and to D*PCM if the weighting function is inverse 
to the input spectrum. The performance results for these optimal coders 
have then been compared with those of suboptimal schemes including 
noise-feedback PCM and DPCM with prefiltering. For first-order Markov 
sources which often serve as a model of actual input spectra we have been 
able to derive simple explicit results in terms of the autocorrelation 
coefficient. In the last part we have examined the effects of channel 
transmission errors on the overall performance of these predictive 
quantizing schemes. We have shown that these coders when appro- 
priately designed are less sensitive to channel errors than PCM. In D*PCM 
channel errors contribute to the total error in exactly the same way as 
quantization noise. Thus the D*PCM results provide a guideline for the 
optimization and a bound for the performance of DPCM. 
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An Automatic Bias Control (ABC) Circuit 
for Injection Lasers 


By A. ALBANESE 
(Manuscript received March 18, 1977) 


A totally electronic method of stabilizing the output light of an in- 
jection laser is presented. This novel method compensates the drift of 
the threshold current in gallium aluminum arsenide lasers by means 
of a feedback signal derived from the laser voltage. 


I. INTRODUCTION 


The threshold current of an injection laser varies from device to device 
and is a function of the device age and temperature. This threshold 
variation causes the laser output to change when the drive current is held 
constant. One must therefore provide a bias control circuit to compensate 
for the threshold variations. 

Feedback circuits using photodetection have been successfully used 
for this purpose.! The output is monitored with an optical detector and 
compared with the input signal to generate an error signal that is fed back 
into the laser current. 

This paper introduces a new concept of compensating the change in 
the laser threshold by using the electrical characteristics of junction 
lasers. An electrical circuit monitors the ac voltage and the ac current 
of the laser and generates the bias current needed to operate the laser 
above the threshold level independently of the laser temperature and 
age. 

One of the benefits of using an electronic feedback scheme to stabilize 
the laser output is the elimination of an optical detector; this reduces 
the number of optical components required and may lead to a more 
economical and simpler solution than using a photodetector. 

The electronic bias control method is based on the fact that the laser 
junction voltage saturates at currents above threshold.2-° Figure 1 shows 
L and V; as functions of I. L is the laser output light at one of the faces, 
V; is the laser junction voltage, and J is the laser current. Figure 1 also 
shows the changes in L and V; produced by varying ! in the vicinity of 
the laser threshold, J;. The laser current is the sum of a bias current J; 
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Fig. 1—Laser junction voltage and laser output light as a function of the laser cur- 
rent. 


and a modulated current J,,. The part of I, below J; (called J,) is an error 
current that does not modulate L but modulates V; by a magnitude Ve. 
Conversely, the part of I, above J; modulates L but not Vj. 

Laser stabilization consists of monitoring the junction voltage and 
increasing the laser bias current until the junction voltage is saturated. 
This is achieved automatically and continuously by an electronic circuit 
called an Automatic Bias Control (ABC) circuit. The ABC circuit monitors 
the laser voltage, generating an error signal proportional to the degree 
that the laser junction voltage is not saturated, and increases the bias 
current in the laser until the error voltage is minimized. The ABC circuit 
consists of an operational amplifier that amplifies the error voltage, a 
peak envelope detector that rectifies the amplifier output, and a current 
source that produces the current to bias the laser. 

The circuit shown in Fig. 2 was built to study some of the character- 
istics of the electronic feedback method proposed here. Without stabi- 
lization provided by this circuit, the light output L of an injection laser 
varies strongly with temperature, as shown in Fig. 3 by the curve labeled 
“without feedback.” Here, a constant bias of 124 mA was supplied, and 
the laser was modulated with a 70-kHz, 5-mA sinusoidal signal. One sees 
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Fig. 2—Automatic bias control circuit. 


immediately from the other curve, labeled “with feedback,” how use of 
the ABC circuit improves the output stability. The variation AL/AT is 
reduced from 0.37 mW/°C to 0.023 mW/°C, a factor of 16. 

The following sections cover the details of the functioning and the 
limitations of the ABC circuit. 
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Fig. 3—Laser output as a function of the laser temperature. 
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Fig. 4—System representation of the ABC circuit. 


ll. SYSTEM REPRESENTATION 


In this section a mathematical model for the laser together with the 
ABC circuit is developed. This model is used to explain the function of 
the different parts needed to stabilize the laser output. 

To analyze the operation of the ABC circuit, let us assume that the laser 
is biased with a direct current J, which is smaller than the threshold 
current level J;. Then, according to Fig. 1, V. can be expressed as a 
first-order approximation by 


RI, for OSI, S Im 


= 1 
Ve tae for Ln < Ie oe 
where 
R= oF , evaluated at J ~ I;, (2) 
and 
I, = 1, — Ip. (3) 


The purpose of the ABC circuit is to minimize J,, by monitoring V, to 
control J,. Figure 4 shows a diagram of the ABC circuit and the laser. The 
laser is represented by the box indicated by dashed lines. It has one 
output, V,, and three inputs, J,,, [, and Jy. In principle the ABC circuit 
is a transadmittance amplifier, with a gain Y, that amplifies and converts 
V.. into Ip: 


Iy = Inn + (Y X Ve). (4) 
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Ipo is a fixed prebias current, which is smaller than the minimum J,; Io 
is optionally provided to reduce the current in the output transistor and 
gain required of the transadmittance amplifier. 

Equations (1) through (4) determine the operating points of the laser 
to be: 


I, — Ino I, — Igo 
i, =],-—_— f <—— < 
a ear oe Ve ag ot (5) 
and 
_ teal 
°  1+A (6) 


A is the closed-loop gain given by the product R X Y. Equations (5) and 
(6) indicate that if 1 + A > U; — Ig0)/Im then 


I, ~ I, (5a) 
and 
Im > Ie * 9, (6a) 


that is, the laser is biased at the laser threshold, the error current be- 
comes small, and the modulating current is above the laser threshold. 


lil. INSTABILITY OF THE LASER OUTPUT 


It will be assumed that the instability of the laser output is mainly 
caused by the variation of the laser threshold. The laser output power, 
L, produced by a fixed current J is 


L=S(U-I;) (7) 
where S is the differential quantum efficiency of the laser. Then the 
output instability is defined as: 

== =5: (8) 

When using the ABC circuit the laser has an additional bias current 
I,, and in this case the power output is: 

L,=SU +I], —I;). (9) 

Then, according to Eq. (8), the instability of the laser driven by the ABC 


circuit is: 


U=SP=-8(1-F) (10) 


AUTOMATIC BIAS CONTROL FOR INJECTION LASERS = 1537 


—-—-— LIGHT 
VOLTAGE 


OUTPUT LIGHT IN mW 





LASER JUNCTION VOLTAGE (10mV/div) 





=-— — —-oe 4 —— — 0 
70 80 90 100 110 120 130 140 150 160 170 


LASER CURRENT IN mA 


Fig. 5—Laser junction voltage and output light as a function of the bias current and 
the temperature. 


Using Ea. (5) to calculate the derivative of J,, one obtains 


-S 
U; = ——- 11 
TAF1 oP 
The comparison of Eq. (11) with Eq. (8) indicates that, when the ABC 
circuit drives the laser, the instability of the laser output decreases 
(A + 1) times, as would be expected for a closed-loop system of 
gain A. 


IV. BIASING THE LASER BELOW THRESHOLD 


This analysis has assumed so far that, above threshold, V; is perfectly 
saturated and that V, vanishes. This is not the physical case, because 
there is always a minimum V,, called V,,, caused by the noise at the 
output of the operational amplifier and the lack of saturation of V; (see 
Fig. 5). This deviation from the ideal situation requires one to limit the 
value of A such that 


AXVe 
R > 





I, — Ibo > (12) 


otherwise the amplified noise will produce an J, > J;, and the circuit will 
overrespond to changes in V,. 

In order to be able to increase the value of A above that determined 
by eq. (12), a voltage AV is subtracted from V, such that AV > V,. 
Then, the feedback loop increases J, until V. = AV. AV ¥ O causes the 
laser to be biased below the threshold level. Figure 6 shows a diagram 
of the ABC similar to that of Fig. 4 but including AV. In this case, the new 
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Fig. 6—System representation of the modified ABC circuit. 


operating point of the laser is 


I,-Ipno, A AV 
I, =+— + ———_.. —. 1 
°*  At1 Ati R’ ae) 
(I, — Ino)R 


AV andA > 1 


then for A > 


ay (18a) 


I, * 
where AV > V,,. 
In eq. (18), AV is aconstant but R is a function of I (see Fig. 7); R is 
the derivative of V;, evaluated at a current below and near threshold. 
Because V; is a logarithmic function of J, 


dV; _ Vr 
= 14 
dq i a 


where V7 = nkT/q. The constant n characterizes the semiconductor 
junction, k is the Boltzmann’s constant, T is the junction temperature, 
and q is the electron charge. 

By an analysis similar to that used for eq. (11), we can find a corrected 
value of the instability, using eqs. (14), (13), (8), (9), and (10). This new 
value, called Us, considers the effect of having a AV ¥ 0: 


1 A AV 
UPS Ga a) 15 

2 AVI Ad Va 5) 
The minimum and limiting value of U2 is obtained by having an A> 1 
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Fig. 7—R as a function of the laser threshold. 


and A > V7/AV; then eq. (15) results: 


AV 

Us S Ve (16) 
According to eq. (16) the maximum improvement in the laser stability 
is determined by the ratio V7/AV. AV is caused by the nonsaturation 
of V;, V,,. Figure 5 shows L and V; as a function of J and T. At L = 6 mW 
and T = 0°C, V,, is less than 2 mV. Therefore one could make AV = 
2mV. Vr can be computed from the data in Fig. 7 using eq. (14), Vr = 
32 mV. For these values, one finds V7/AV = 16. According to eq. (15), 
V7/AV determines the improvement in the laser instability caused by 
the ABC circuit. 


V. ELECTRICAL CIRCUIT 


The different parts of the ABC circuit are described in this section. As 
is shown in Fig. 2, the circuit consists of an operational amplifier, a peak 
detector, and a current source. 

The operational amplifier has three functions: first, it determines V; 
by compensating for the voltage drop in the laser series resistance; sec- 
ond, it filters the dc component of the junction voltage which depends 
on the laser temperature; and third, it amplifies the ac component toa 
value that can be processed by the envelope detector. In general, the 
bandwidth of the operational amplifier should be larger than the 
bandwidth of the modulating signal. This is required by the expression 
Alm > I; — Ino, from eq. (5). But in certain cases, like that of video sig- 
nals, one may use the synchronization signal to control the laser instead 
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Fig. 8—Circuit to monitor the laser junction voltage. 


of using the full bandwidth of the signal. In other cases one may add a 
low frequency signal for the ABC circuit. 

Figure 8 shows a simplified portion of Fig. 2, with the two capacitors 
removed. The circuit was used to analyze the laser diode. It generates 
an output voltage V,, which can be expressed in terms of Vj, the laser 
current J, the laser series resistance R,, and other parameters of the 
circuit like the resistance R,, the potentiometer ratio K, and the am- 
plification A of the differential amplifier: 


V, = {Vj — K) + 17[R, — (R, + Ry) KA. (17) 


The circuit of Fig. 8, similar to a bridge circuit, compensates for the 
voltage drop across R, by having K set to 


Rs 


K=——*— 18 
R,+R, ee 
which results in 
AXR, 
eV 19 
ss R, +R, Mi m7) 


indicating that V, is proportional to Vj. Ifa large A is desired, then the 
circuit should include an offset control to shift the output voltage so V, 
is within the dynamic range of the amplifier. Figure 5 shows how the laser 
output light and V, vary with J in the vicinity of threshold and at dif- 
ferent temperatures of the laser. These data are useful to determine V,,; 
ie., for L = 6 mW and T = 0°C, V,, is less than 2 mV. 
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Fig. 9—Voltage at the output of the operational amplifier and laser output light as a 
function of the laser bias current at three different temperatures and without feedback 
connection. 


In the circuit shown in Fig. 2, C; and Cz remove the dc component of 
the input signal and the laser voltage, respectively; the potentiometer 
R, and the resistor R; compensate the voltage drop in R,. The feedback 
resistor R, determines the gain of the operational amplifier. Resistor R4 
sets the dc level of V,, compensates the voltage drop across the diode 
D, and the emitter-base voltage of Q, and determines AV. In the circuit 
of Fig. 2, the peak output voltage is given by 


meee! 
Re 


Figure 9 shows the open-loop output of the operational amplifier of 
Fig. 2 and the laser output light when the laser current is modulated by 


Va Xl: (20) 
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Fig. 10—Laser threshold and differential quantum efficiency as a function of the laser 
temperature. 


a sinusoidal signal, as a function of the laser bias J, at three different 
temperatures of the laser. The photographs clearly show that V, = 
V, ~ 0 for currents above threshold, and different laser tempera- 
tures. 

The peak detector that determines the maximum error voltage consists 
of a rectifier and a low-pass filter. Different types may be used, and a 
simple one is shown in Fig. 2 which consists of a half-wave rectifier (D1) 
and a low-pass filter formed by the capacitor C3 and the input resistance 
of the current source. 

The current source consists of an emitter-follower that converts the 
voltage from the peak detector into a current which is amplified by 
transistor Qo. For the circuit of Fig. 2, the output of the current source 
is 

I, — Ibo eee (21) 

Rs 
where £2 is the current amplification factor of Qo. The peak output V, 
of the operational amplifier A1 is given by eq. (20). According to Fig. 4, 
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one can define the current amplification A as 

Ty — Too _ B2X Ra XR 
Ie Rs X Rs 

At room temperature, J; = 120 mA, R = 0.252. Bo = 40, R, =20,R3 = 


100 2 and Rs = 200 2. One obtains an amplification A = 2500. This 
amplification satisfies the condition of eq. (13a). 


A=RXY= (22) 


Vi. DISCUSSION 


An electronic circuit to improve the stability of an injection laser was 
presented. Emphasis was placed on describing the fundamentals of the 
electronic feedback rather than comparing its performance and limita- 
tions with other methods of intensity control. 

The ABC circuit assumes that the laser junction voltage saturates 
above threshold, and it did not consider laser anomalies like kinks in the 
L —I curve nor changes in the differential quantum efficiency, S. 

The changes in S were neglected because they are less important than 
the changes in J; produced by temperature variation and age. This can 
be confirmed by looking at Fig. 10 which shows J; and S as a function 
of the laser heat-sink temperature. 

The circuit was operated with low-frequency signals below 100 kHz, 
and no effort was made to improve the frequency response of the am- 
plifier so the ABC circuit could be used to transmit analog or digital 
signals at frequencies larger than 100 kHz. 

The circuit described here may be useful to bias lasers near threshold 
during aging studies. 
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More on Rain Rate Distributions and 
Extreme Value Statistics 


By S. H. LIN 
(Manuscript received October 14, 1977) 


A new methodology is described for estimating the 5-minute rain 
rate distribution from yearly 5-minute maximum rain rate data and 
yearly accumulated rainfall data published by the National Climatic 
Center for U.S. locations. The method previously described gives the 
high rain rate portion of the distribution, whereas the extended meth- 
odology yields the complete distribution, which is assumed to be ap- 
proximately lognormal. The three parameters characterizing the log- 
normal distribution can be calculated by application of the theory of 
extreme value statistics. The calculated results agree well with the 
20-year data. The accuracy of the calculated results is limited by the 
instability of extreme rain rate data witha finite time base. Two-year 
rain rate data measured by a tipping bucket rain gauge at Palmetto, 
Georgia, are used to demonstrate that the time variation of rainfall 
process obeys a proportionate relationship, supporting the lognormal 
hypothesis. 


1. INTRODUCTION 


Reference 1 has described a methodology for calculating long-term 
distributions of high rain rates by applying the theory of extreme value 
statistics to the yearly maximum 5-minute rain rate data published by 
the National Climatic Center.2? The obtained high rain rate distribu- 
tions cover the range of interest to the engineering of terrestrial micro- 
wave radio links. However, for other applications, such as earth-satellite 
radio engineering, the rain rate distributions in the moderate and low 
rain rate ranges are also needed. This paper describes a methodology 
to obtain rain rate distributions covering the entire range (i.e., from below 
5 mm/hr to greater than 200 mm/hr). The rain rate distributions are 
assumed to be approximately lognormal. The three parameters char- 
acterizing the lognormal distribution can be calculated from the yearly 
maximum 5-minute rain rate data and the yearly total accumulated 
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Fig. 1—Binghamton, New York: Comparison of 20-year distribution of 5-minute rain 
rate calculated by extreme value theory and lognormal hypothesis (dashed line) with 
20-year data (solid line). 


rainfall data by applying the theory of extreme value statistics. The 
calculated results agree well with 20-year data as shown in Figs. 1 to 13 
and Figs. 17 to 20. 

Sections II and III describe the method and discuss the results. Section 
IV discusses characteristics of measured time variations of rain rates in 
support of the proportionate effect described by Aitchison and Brown.?° 
The proportionate variation of rain rates is simply another manifestation 
of the lognormality of rain rate statistics. 

In this paper, a “5-minute rain rate” corresponds to the average value 
of the randomly varying rain rate in a 5-minute interval and is calculated 
as AH/r where AH is the 5-minute accumulated depth of rainfall and 
7 = 5 minutes or 4/¢ hour is the rain gauge integration time. The meth- 
odology is also applicable to integration times other than 5 minutes. 


ll. EXTREME VALUE STATISTICS AND LOGNORMAL RAIN RATE 
DISTRIBUTION 


Many sets of rain rate data indicate that rain rate distributions can 
be closely approximated by the lognormal distribution (see Refs. 4 to 
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Fig. 2—Comparison for Trenton, New Jersey. 


9, 23, 24 and Figs. 1 to 13 and 17 to 20 in this paper)*: 


Inr — a 
V2 Sp 


where R is the randomly varying 5-minute point rain rate, erfc(~) de- 
notes the complementary error function, In(~) denotes natural loga- 
rithm, Sp is the standard deviation of In R during the raining time,* R,, 
in mm/hr is the median value of R during the raining time and Pp is the 
probability that rain will fall at the point where the rain rate R is mea- 
sured. Rain rate data usually emphasize high rain rate statistics with the 
result that the value of Po, and hence the total raining time per year, are 
not directly available. In the following, it is demonstrated that the values 
of Pp, Rm and Sp, and hence the entire distribution P(R = r), can be 
determined from the yearly maximum 5-minute rain rate data and the 
yearly total accumulated rainfall data. 

Let W denote the long-term average value of the yearly accumulated 
depth of rainfall. The relationship between W and the parameters in 


P(R =r) = Po 5 exfe | (1) 


* Figures 14, 15, 16, 21, and 22 are discussed later in Sections IJ and III. 
+ Excluding snowfall. 
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Fig. 3—Comparison for Harrisburg, Pennsylvania. 


eq. (1) is 
W = (R) X total raining time/year 
= (R) X Po X (8760 hours/year) 
= R,, X eSr’/2 X Po X (8760 hours/year) (2) 
where 
(R) = Rp X e8r?/2 (3)* 


is the mean value of R during the raining time.* Long-term (230 years) 
data on W for U.S. locations can be found in Refs. 2 and 11. 

Let R; denote the yearly maximum 5-minute rain rate which varies 
from year to year. The distribution of R, is! 


P(R, =r) =1-e-&”) (4) 
where 
y =a(Inr — U) (5) 


is called the reduced variate, a and U are scale and location parameters 


* This relationship among (R), Rm and Sp holds if R is lognormal.*!° 
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Fig. 4—Comparison for Pittsburgh, Pennsylvania. 


respectively. Notice that the lognormal rain rate distribution (1) is 
uniquely determined by the three parameters Po, R,, and Sp; whereas 
the distribution (4) of the yearly maximum 5-minute rain rate R, is 
uniquely determined by the two parameters a and U. Gumbel!2:8;22 has 
given the following approximate relationships among a, U and the parent 
distribution (1): 





U-IlnR,, 1 
Pa] Ee 1 6 
( Sr ) Po: N (6) 

Po-N U-InR,, 
= — 6 ( 7 
es ( Sr ) op 

where 
U-InR, 1 U-InR,, 

Sf = 1 = er 8 
ge) as | ® 


is the standard unit normal distribution function, 
d 
o(z) =— &(z) (9) 
dz 
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Fig. 5—Comparison for Lynchburg, Virginia. 


is the normal probability density function, and 
N = total number of 5-minute intervals per year 
= (525600 minutes/year)/5 minutes 
= 105120. (10) 
From eqs. (4) and (5) it is easily shown!*18 that U is the most probable 


value (i.e., the mode) of In R; where R, is the randomly varying yearly 
maximum 5-minute rain rate. Let us define 


R, =e. (11) 


Equation (6) states that, on long-term average, the randomly varying 
rain rate R will exceed R, by approximately 5 minutes per year.* 
Equation (7) further specifies the slope (i.e., the derivative or probability 
density) of the rain rate distribution at R = R,,. Solving eqs. (6) and (7) 


* From eqs. (1) and (6), it is easily shown that 


. _, (U-InRm\| _ 1 


Multiplying this probability by the total time per year yields 5 minutes per year. 
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Fig. 6—Comparison for Rochester, Minnesota. 








yields 
Py-N ‘ 1 
= : - ad Z 
Sr a ? E (1 Po =n) (12) 
and 
1 
= —~99,.6-1/1—- 
Rm = exp lu Spr: ® (1 eo (13) 


where &~!(~) denotes the inverse normal probability function. 
Reference 1 has given a set of formulas for calculating the parameters 
a and U from the yearly maximum 5-minute rain rate data. For com- 
pleteness, this set of formulas is included in Appendix A. Knowing the 
values of W, a and U allows us to solve* the three equations (2), (12), and 
(13) for the three unknowns Po, R,, and Spr. Substituting these three 
parameters into eq. (1) then yields the entire rain rate distribution. 
For example, Table I lists the yearly maximum 5-minute rain rate R, 
measured at Binghamton, New York, for the 20-year period from 1953 
to 1972.2 Applying the formulas in Appendix A to the data in Table I 


* These transcendental equations are solved numerically by a computer iteration pro- 
cess. 
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Table | — Yearly maximum 5-minute rain rates at Binghamton, 
New York 


Yearly maximum 
5-minute rain 


Year rate, mm/hr 
1953 103.63 
1954 100.58 
1955 161.54 
1956 152.40 
1957 91.44 
1958 103.63 
1959 201.17 
1960 243.84 
1961 112.78 
1962 67.06 
1963 115.82 
1964 188.98 
1965 91.44 
1966 134.11 
1967 85.34 
1968 91.44 
1969 106.68 
1970 91.44 
1971 97.54 
1972 76.20 
yields 
a = 3.224 
U = 4.5736. 


The 30-year (1941-1970) average value of W at Binghamton? is 
W = 762 mm/year. 
Substituting this set of W, a and U into eqs. (2), (12), and (13) yields 
Po = 0.018 (i.e., 1.8 percent), 
Rm = 2.631 mm/hr, 
Spr = 1.1015 nepers. 


The lognormal distribution (1) of the 5-minute rain rates calculated from 
this set of Po, Rm and Sp agrees closely with the 20-year data! as dis- 
played in Fig. 1. Similarly, Figs. 2 to 13 show the close agreement between 
the calculated result and the 20-year data at 12 other locations. 
However, high rain rate statistics require a very long time base to yield 
stable results. The sensitivity of the high rain rate distribution with re- 
spect to time base measured at Newark, New Jersey, is shown in Fig. 14. 
It is seen that increasing the time base from 19 years to 21 years signif- 
icantly alters the distribution for rain rate beyond 150 mm/hr. J. W. 
King! and J. Xanthakis!© have presented approximately 100 years of 
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Fig. 7—Comparison for Moline, Illinois. 
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Fig. 8—Comparison for International Falls, Minnesota. 
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Fig. 9—Comparison for Omaha, Nebraska. 
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Fig. 10—Comparison for Oklahoma City, Oklahoma. 
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Fig. 11—Comparison for Green Bay, Wisconsin. 
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Fig. 12—Comparison for Peoria, Illinois. 
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Fig. 14—The sensitivity of rain rate distribution with respect to time base. 
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Fig. 15—Comparison for Scranton, Pennsylvania. 
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Fig. 16—Comparison for Lihue, Hawaii. 
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rainfall data to show that the variations of annual accumulated rainfall 
from year to year is correlated with the 11-year cyclic variations of 
sunspot numbers. R. J. Talbot and D. M. Butler!” have discussed the 
climatic effects during passage of the solar system through nonuniform 
interstellar dust clouds. Therefore, rainfall activity is influenced not only 
by many terrestrial environmental factors but also possibly by extra- 
terrestrial sources. The required time base for stable rainfall statistics 
may be longer than 20 years. Since the parameters of the lognormal 
distribution are estimated from the yearly maximum rain rate data, the 
instability noted in Fig. 14 limits the accuracy of the calculated results.* 
Figures 15 and 16 give two examples of the effects of unstable high rain 
rate data on the estimated rain rate distributions. 


ill. FIFTY-YEAR DISTRIBUTIONS 


Section IV of Ref. 1 describes a set of formulas for calculating the 
parameters a and U from rainfall intensity-duration-frequency curves 
for U.S. locations published by the Weather Bureau.? These curves are 
derived by the Gumbel method!18 using the theory of extreme value 
statistics and are based on approximately 50 years (1900-1950) of rainfall 
data. From this data source, we need only the following three numbers 
for a given location to calculate a and U: 


M = the number of years of rainfall data from which rainfall- 
intensity-duration frequency curves are derived, 

Tq = the extreme rain rate with 2-year return period, i.e., the rain 
rate which is exceeded once in 2 years, on the average, by the 
yearly maximum 5-minute rain rates, 

r, = the extreme rain rate with 10-year return period, i.e., the rain 
rate which is exceeded once in 10 years, on the average, by the 
yearly maximum 5-minute rain rates. 


Therefore, in principle, long term distribution (1) of 5-minute rain 
rates for U.S. locations can easily be obtained by this method. The only 
input required are the four parameters W, M, r, and rp for each location 
read from Refs. 2 and 3. 

For example, for San Francisco, California, the four numbers are 


Ww 


115 mm/year 


M = 48 years (1903-1950) 
rq = 1.9 inches/hr = 48.3 mm/hr 
ry = 3.05 inches/hr = 77.5 mm/hr. 


* The accuracy of the calculated results for the very low rain rate region (i.e., < 10 
mm/hr) may be also limited because the parameters Po, R,, and Sp are estimated from 
the extreme, high rain rate data. 
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Fig. 17—Comparison of 48-year (1903-1950) distribution of 5-minute rain rate calculated 
by extreme value theory and lognormal hypothesis (dashed line) with 20-year (1953-1972) 
data (solid line) at San Francisco, California. 


The formulas for calculating a and U are given in Appendix B for com- 
pleteness. By substituting these values of M, rz and r, into eqs. (26) to 
(31) we obtain 


a = 3.6297 
U = 3.7786. 
Substituting this set of W, a and U into eas. (2), (12) and (13) yields 
Po = 0.0016 (i.e., 0.16 percent) 
Rm = 6.238 mm/hr 
Spr = 0.7771 neper. 


Figure 17 shows that the calculated lognormal distribution of 5-minute 
rain rates for the 48-year period (1903-1950) is reasonably close to the 
20-year data (1953-1972). Similarly, Figs. 18, 19, and 20 show the 
agreement between calculated results (=43 years) and the 20-year data. 
On the other hand, Figs. 21 and 22 give two examples of appreciable 
differences due to the instability of high rain rate data. 
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Fig. 18—Comparison of 48-year (1903-1906, 1908-1951) distribution of 5-minute rain. 
rate calculated by extreme value theory and lognormal hypothesis (dashed line) with 
20-year (1953-1972) data (solid line) at Valentine, Nebraska. 


103 


LANDER, WYOMING 


10! 


N 
\ 


10° LOGNORMALY 
46 YEARS ™~ 


20 YEARS DATA 
(1953-1972) 





AVERAGE NUMBER OF MINUTES PER YEAR RAIN RATE EXCEEDS ABSCISSA 


107 
0 20 40 60 80 100 120 140 160 180 200 220 240 260 280 300 


5-MINUTE POINT RAIN RATE IN MM/HR 


Fig. 19—Comparison of 46-year (1905-1950) distribution of 5-minute rain rate calculated 
by extreme value theory and lognormal hypothesis (dashed line) with 20-year (1953-1972) 
data (solid line) at Lander, Wyoming. 
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Fig. 20—Comparison of 43-year (1908-1950) distribution of 5-minute rain rate calculated 
by extreme value theory and lognormal hypothesis (dashed line) with 20-year (1953-1972) 
data (solid line) at Sheridan, Wyoming. 
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Fig. 21—Comparison of 47-year (1903-1907, 1909-1920, 1922-1951) distribution of 
5-minute rain rate calculated by extreme value theory and lognormal hypothesis (dashed 
line) with 20-year (1953-1972) data (solid line) at Salt Lake City, Utah. 
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Fig. 22—Comparison of 49-year (1903-1951) distribution of 5-minute rain rate calculated 
by extreme value theory and lognormal hypothesis (dashed line) with 20-year (1953-1972) 
data (solid line) at Scranton, Pennsylvania. 


1V. PROPORTIONATE EFFECT AND LOGNORMAL RAIN RATE 
DISTRIBUTION 

The rainfall process is influenced by many environmental parameters. 
An important question is whether the environmental parameters affect 
the rain rate in a proportional fashion or in an additive fashion. It is well 
known>-10.21 that a proportional fashion leads to a lognormal distribution 
whereas an additive fashion leads to a normal distribution. The following 
rain rate data will shed some light on this question. 

Rain rate data measured in Illinois,!}® New Jersey,!9 and Canada?° 
indicate that the short term mean rain rate (R), and the deviations, AR, 
from the short term mean (R), appear to be correlated. The subscript 
s in this section denotes “short term” mean value. These data indicate 
that the magnitude of the deviations, AR, tends to increase with the short 
term mean (R),. In the following, we present 2 years of rain rate data 
measured by a tipping bucket rain gauge at Palmetto, Georgia, to confirm 
this correlation between AR and (R),. 

Figure 23 displays the time-varying rain rate, R(t), in two 1-hour pe- 
riods measured by the tipping bucket rain gauge at Palmetto, Georgia. 
In Fig. 23a, the hourly mean rain rate (R), is 12.6 mm/hr and the hourly 
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a Fig. 23—Time-varying rain rate measured by a tipping bucket rain gauge at Palmetto, 
eorgia. 


mean deviation (|AR|), is 8.8 mm/hr where 
[AR(¢)| = |R(t) — (R)s| (14) 


In Fig. 23b, the values of (RF), and (| AR|), are 17 and 13.4 mm/hr, re- 
spectively. Two years of rain rate data at Palmetto have been processed 
in this fashion and all the hourly (R), and (|AR| ), pairs are plotted in 
Fig. 24. It is seen that (R), and (|AR|), are indeed correlated and the 
average relationship is approximately a straight line with a 45 degree 
slope on the log X log graph paper. To examine this proportional rela- 
tionship more closely, let 


X(t) = n R(t), | (15) 
| AX(t)] = |X(é) — (X)s| (16) 
The relationship between the hourly mean value (X ), and the hourly 
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Fig. 24 Two-year data on correlation between hourly mean rain rate, (R)s, and hourly 
ete deviation, (|AR|);, measured by the tipping bucket rain gauge at Palmetto, 
eorgia. : 


mean deviation (|AX|), processed from the same 2-year rain rate data 
are plotted in Fig. 25. It is seen that (|AX|), is practically independent 
of (X),. In Figs. 23 to 25, we use 1-hour period for short-term mean only 
as an example. The 2-year data were processed by several different 
“short-term periods” ranging from 5 minutes to 1 hour and showed es- 
sentially the same correlation between AR and R. Figures 24 and 25 in- 
dicate that AR is approximately linearly proportional to R: 


AR=h-R (17)* 


where h is a proportional parameter. The scattering of the data in Fig. 
24 and the random variations of AR in Fig. 23 indicate that the propor- 
tional parameter, h, is not a constant, but is a time-varying random 
variable. Equation (17) can be interpreted in that the change, AR, in the 
rain rate is proportional to the product of the rain rate R and the in- 
tensity of the cause, h. In other words, the environmental parameters 
affect the rain rate in a proportional fashion. Therefore, the data of Figs. 
24 and 25 are another manifestation of the lognormal rainfall process 
and support the lognormal hypothesis (1). Readers interested in the 


* Equations (15) and (17) imply that AX is independent of X and is consistent with the 
data in Fig. 25. 
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Fig. 25—Two-year data measured by the tipping bucket rain gauge at Palmetto, Georgia, 
demonstrating that the hourly mean deviation, {|AX|),, and the hourly mean value, (X), 
are uncorrelated, where X = In R. 


derivation of the lognormal distribution from the proportionate rela- 
tionship (17) are referred to Refs. 10 and 21. 


V. CONCLUSION 


A new method has been described for calculation of 5-minute rain rate 
distributions from yearly maximum 5-minute rain rate data and yearly 
total accumulated rainfall data which are available from National Cli- 
matic Center?:3 for U.S. locations. By applying the theory of extreme 
value statistics and the lognormal hypothesis, the obtained rain rate 
distribution covers the entire range of rain rates (i.e., from below 5 mm/hr 
to greater than 200 mm/hr) for wide application. The calculated results 
agree well with long term (20 to 50 years) data as shown in Figs. 1 to 13 
and 17 to 20. The accuracy of the calculated results is limited by the in- 
stability of the high rain rate distribution with finite time base. 
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Formulas for Calculating Extreme Value Parameters a and U from Yearly 
Maximum Five-Minute Rain Rates 


Let R1V),7 = 1,2,3,...,M be the measured yearly maximum 5-minute 
rain rate in M years of measurements, and let 


x1) = In [RiGQ)] (18) 
The formulas for calculating a and U are: 


Oz 








a=? (19) 
Ox 
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= Zz 
U= xy7 7 (20) 
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x1 = Mo x19 
is the sample mean of xj, 
1 M ay staal 
o, = Frese [x1G) — x1] | (22) 
is the sample standard deviation of x1, 
J 
= —] = | ? 23 
2o)= =n (-Ing) es 
z= ¥ 2()) (24) 
= 2 ; 
2 Mn 
and 
_j 1 M ) — x2 1/2 (25) 
= aga EEO 7] 
APPENDIX B 


Formulas for Calculating a and U from Rainfall Intensity-Duration-Frequency 
Curves 


A= Ao * Oz *—> (26) 
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1 Zz T 
U=U.+— —-—-—=}], 
Qo E Oz a (27) 
Ag —A 
»=— se F" (28) 
Inr, —Inrz 
and 
A, Inr, — Ap Inrg 
0S 
Aq — Ap (29) 
where 
A, = —lIn [in Aa , (30) 
Q, | 
Ap = —In | n —— ]> 1 
i n | n Ores (31) 
Qa = 2 (years), (32) 
Q» = 10 (years), (33) 
y = Euler’s constant ~ 0.5772, (34) 


Z and o, are defined by eas. (24) and (25). 
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The general problem ts considered of calculating the voltage when 
a sinusoidal current generator is connected to a parallel combination 
of a linear and a nonlinear resistance load. A practical algorithm is 
described for computing any desired moment and any Fourier com- 
ponent of voltage. An alternative approximate treatment is also pre- 
sented which avoids numerical integrations and is valid when the 
nonlinear characteristic is rapidly varying. Both methods are applied 
to a photovoltaic optical telephone receiver employing a silicon nxp- 
photodiode and a conventional ring armature telephone receiver cou- 
pled by a transformer. Harmonic distortion is presented for several 
illustrative cases. A clipping level is defined for the receiver, and it is 
proposed that the receiver clipping level should be matched to the 
clipping level of the analog optical channel bringing the signal. On the 
basis of this principle a simple procedure is given, along with the nec- 
essary curves, for determining the required optical power at the source 
and the optimum transformer ratio for any value of transmission loss 
between source and receiver. An illustrative example is given for an 
analog dynamic range of 18 dB that requires a peak source power in the 
lightguide of 0.9 mW. This type of receiver may find limited application 
if lightguides ever serve customers directly. 


l. INTRODUCTION 


There is now strong indication!” that optical transmission using 
lightguides? and optical cables is technologically approaching a readiness 
for use in telecommunications. While we are not here suggesting that 
any extensive application of optical telephones is foreseeable, we have 
nevertheless found the prospect of limited special applications suffi- 
ciently interesting to undertake a study of optical telephone receivers 
from a device point of view. The receiver is only one of a number of de- 
vices, some of which perhaps have not even been invented yet, that would 
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be required in an optical telephone. In addition, further devices would 
be required in the electrical-optical interface between the lightguides 
and the metallic network. The receiver, however, determines one key 
property of the system, the optical power required at the interface to 
transmit speech to the human ear with an acceptable volume and qual- 
ity. 

We presuppose that an optical telephone receiver is required to con- 
vert analog-modulated light power to sound pressure at the ear with no 
other power available. There are two mechanisms that might be em- 
ployed to do this using analog modulation: the optoacoustic effect in 
which sound is directly produced when power-modulated light is ab- 
sorbed, and the photovoltaic effect in which an intermediate electrical 
signal is produced which produces sound by way of an electrical ear- 
phone. We have previously completed a theoretical+° and experimental® 
study of optoacoustic receivers in which it was necessary to solve a variety 
of linear acoustical problems to establish the feasibility of the device and 
to obtain its response. Nonlinear distortion was not considered and is 
not believed to be very important for optoacoustic receivers. Optimi- 
zation in that case is a matter of maximizing the response subject to the 
requirement of a flat response over the telephone voice band, 300- 
3300 Hz. 

Subsequently we have attempted to define an optimum photovoltaic 
receiver in a similar way on the basis of maximizing the linear response. 
It might at first appear that this is a simple problem, because a suitable 
photodiode and earphone are already available, the required frequency 
response has already been engineered into the earphone, and one might 
expect that it is only required to match the impedances of the photodiode 
and the earphone by a simple two-port network (e.g., a transformer). 
However, we have now concluded that no optimum of this type exists 
for the photovoltaic receiver, and a different principle of optimization 
is required which is based on the nonlinearity of the photodiode and the 
quality of speech reproduction required in the system. Thus it has not 
been possible to keep system concepts completely out of the discus- 
sion. 

We propose here a very simple principle of optimization, which we call 
“quality matching,” which leads to important conclusions about the 
optical power at the interface, the dynamic range of the system, sound 
levels in the system, and of course, the receiver design. 

We base our discussion on the circuit of Fig. 1. Modulated light power 
u is conducted by a lightguide to the photodiode, which may be regarded 
as a current generator g in parallel with a junction current j(v). The 
photodiode assumed is a specific silicon ntap* structure’ designed and 
packaged for lightguide use with a light-sensitive area of diameter 
80 wm and a quantum efficiency at 900 nm of about 7 = 0.8. It is ideal for 
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PHOTODIODE EARPHONE 
(a,B,Y) (p,8) 


Fig. 1—Circuit of photovoltaic receiver. The earphone resistance p is transformed 
tor. 


our purpose because of its low series resistance (~ 1 Q). The earphone 
is the ring armature telephone receiver® which provides good sensitivity 
with essentially flat response over the band 300-3300 Hz. The photo- 
diode and earphone are coupled by an ideal transformer having a turn 
ratio n adjustable by virtue of primary taps. The size and cost of the 
transformer would be approximately proportional to the maximum value 
of n. The effective load resistance seen by the photodiode is zero at dc 
and r = n“p at all signal frequencies, p being the earphone resistance. 
The value of n is to be selected at the time of installation in accordance 
with our optimizing principle. 

The first result to be described in this paper is the nonlinear analysis 
itself, in Section II. Mathematically our problem is the following: Find 
the periodic voltage response p(t) to a source current g; cos wt in Fig. 1 
assuming r is linear and independent of frequency (except dc), j(v) is 
nonlinear and monotonic, and p(t) has zero average value. What makes 
the problem awkward to treat by textbook methods’ is the restraint on 
the average value. The analysis, so far as we know, is not covered in texts 
on nonlinear circuits, and may be applicable to a variety of situations. 
In Section III is given an approximate method called the clipping model 
which we have found quite reliable and especially appropriate for the 
photovoltaic receiver. 

The main body of the paper consists of Sections IV, V, VI, and VII, 
devoted to the sensitivity, harmonic distortion, clipping level, and quality 
matching of the receiver, respectively. The sensitivity is an inverse 
measure of response (the smaller the better!) defined here in the same 
way as in our previous work as the amplitude of sinusoidal (power) 
modulation of u required to produce at the ear the average speech power 
level (81 dB SPL) found in the telephone network surveys. It is interesting 
to note that the minimum sensitivity is achieved for r ~ 3 X 10° Q, which 
is considerably smaller than the small signal resistance of the photodiode, 
=~ 8 X 109 0. This shows the essential importance of the nonlinear analysis 
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presented here since a linear analysis would lead to the equality of these 
two resistances. The minimum sensitivity is not, in general, the optimum 
because of nonlinear distortion. 

We have made extensive calculations of the second, third and fourth 
harmonic distortions and the total harmonic distortion. Some curves 
of total harmonic distortion for a few selected cases are presented in 
Section V. We have found it difficult, however, to draw concrete con- 
clusions from a consideration of the harmonic distortion that could be 
used as the basis for an optimizing principle. Rather, we have turned to 
clipping as the most convenient, relevant, and useful way of specifying 
the nonlinear distortion. 

The clipping model, Section III, is based on the assumption that the 
distortion of the waveform is an abrupt one-sided clipping of the peak. 
This assumption is shown to be an adequate modeling of the nonlinear 
effects of an exponential junction characteristic. It is shown in Section 
VI that a clipping level can be defined which is analogous (except for 
being one-sided) to the clipping level of an analog-modulation channel. 
Our optimizing principle, ‘quality matching,” then follows in Section 
VII in an obvious way, namely that the clipping levels of the receiver and 
the analog light channel feeding the receiver should be set equal to each 
other. The receiver then has the minimum sensitivity consistent with 
the requirement that the quality of the channel not be degraded by the 
receiver. This means that the system could operate at the minimum 
power consistent with a given dynamic range. By using the curves given 
in Section VII, quality matching can be carried out to determine for any 
desired dynamic range the optical power required and the correct 
transformer ratio for the receiver needed for the particular value of the 
transmission loss at that receiver. 

A summary with discussion and conclusions is given in Section VIII. 
This is followed by two appendixes containing useful background ma- 
terial on speech quality and the electrical-optical interface. 


Il. NONLINEAR ANALYSIS 


In the circuit of Fig. 1 ignore the light, the transformer, and the ear- 
phone; assume that the effective load resistance r is constant at all signal 
frequencies of interest and zero at dc. Let the current generator be 


& = 80 + 81089 


6= wt 
80 2 81 2 0. (1) 
Define averages over time by 
1 T 
(FO0=>- f fOd0= fo. (2) 
Qa =r 
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The equation of the circuit is 


v —TJo = 1rgi cos 0 — rj(v), (3) 
which implies the relation 
vo = 0. (4) 
Concerning j(v) we only assume 
Jv =dj/dv2>0,  j(0) =0 (5) 


which rules out negative-resistance instabilities. We call v(@) the solution 
and 6(v) the inverse solution. 
Define the parameters 


w(r)=a, (0) =, (6) 


and 
g2=5[(b £4) + rG(d) + ja). (7) 


We can assume p and 0 are restricted to the domains 
a<v<b, 0<0<7c7 (8) 


and the inverse solution 4(v) is unique. It follows that 


1 7 
my zat— f 8(v) dv (9) 
Tv a 
and 
, ' 1 b. 
jo = ila) + — j. JO) dv. (10) 
From (3) we obtain the inverse solution 
6(v) = cos 1[(v + rj(v) — q+)/q-] (11) 
and the implicit relations which determine a,b 
q+/Jo =r (12) 
(q-/q+)Jo = &1- (13) 


Our algorithm proceeds as follows: (i) choose initial values for a,b; 
(ii) compute qs. from (7) and J, from the integral (10); (iz) test (12) and 
(13); (tv) iterate this procedure with adjusted values of a,b until the 
desired precision is achieved; (v) the inverse solution is now given by (11) 
with the final values of a,b; (vi) as a check, evaluate v, from (9) and test 
(4). Since (9) and (10) involve numerical integrations, this algorithm 
requires a large computer. 
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A function of v such as j(v) can be expanded in a Fourier cosine se- 
ries 


i(() = jo +S jx €08 0 (14) 
Jk = 2(j((8)) cos RO) o. (15) 


Write (15) in the form 
2 b. 
jp=— § j(v) sin k0 dv. (16) 
wk Ja 


It follows that the Fourier coefficients of v are 


2 b 
= f Sneods. (17) 
ak a 


From (8) it follows that vz is also given by 
Vk = q—6r1 — TJk (k = 1,2,...), (18) 


which is preferable to (17) for numerical work because it automatically 
becomes exact as J, — 0. The moments of v(@) can be obtained in the 
same way as (10) 


(v(0)®)g =ak+ id f vk-l @(y)dp, (19) 
wT Ja 


Define a normalized waveform 


(0) = (v(8) — q+)/q- (20) 


which has the property of reducing to the input waveform cos @ whenever 
J(v) = 0. If 7(v) is a rapidly increasing function of v (such as the expo- 
nential characteristic of a junction), it may be that the nonlinearity of 
J(v) can be neglected over part of the cycle while over the rest of the cycle 
the nonlinearity effectively clamps v at a constant upper limit. This is 
one-sided abrupt clipping. It is customary to define the clipping fac- 
tor ® in terms of the ratio of the true analog peak to the clipped peak as 
follows 


& = —20 log dmaxs (21) 
where from (20) 
dmax = 6(0) = (6 — q+)/q-. (22) 
The power delivered to the load at frequency kw is 
Dr = vp/2r. (23) 
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A common way of specifying nonlinear distortion is in terms of the ra- 
tios 


dr - Pr/P1 (k = 2,3, o- .) (24) 
and 
d= ¥ dy = (20 )ol#) ~ 1. (25) 


We define the kth harmonic distortion by 
Dy = 10 log dp, (26) 
and the total harmonic distortion by 
D = 10 log d. (27) 


ll. CLIPPING MODEL 


Assume that the distortion may be represented as abrupt clipping of 
the positive peaks. Then 


v(0) = w cos (6,7) — w (cos (8,7)) 9, (28) 
where w and 7 are parameters to be determined, and 


7, O0<0<7 


On =|) TSOK<0 2) 
A simple calculation gives 
(cos (6,7))9 = (7 cos 7 — sin r)/z. (30) 
The clipping factor is 
@ = —20 log (cos 7). (31) 


The Fourier coefficients are now obtained without numerical integration 
from the relations 


vy = (w/a)(a — 7 + 5 sin 27) 


=wt... (32) 
vp = [2w/a(k? — 1)] (cos kz sin t — R71 cos 7 sin kr) (k = 2,38,...) 
= —(2w/38r)72 + ...(k < 774), (33) 


and the total distortion from 
1 
(v2) 9 = (w2/2r) (a + 7 cos 27 — 5 sin 27) — w?(cos (0,7) )3 
= (v7/2)[1 + (4/15r)75 +... (34) 
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For the determination of w,7 consider (11) in the form 
v(0) = q-cos 6 + qy. — rj(v). (35) 
Comparing (28) with (35) shows that 


q-=w, q+ = —w(cos (6,7))o, (36) 
and that j(v) is being approximated by 
Vc ee mee (37) 
The normalized waveform defined in (20) is being approximated by 
(0) = cos (6,7). (38) 


When the model is valid, it is also valid to neglect j(a) in (7); it then 
follows from (7), (12) and (13) that w,r must obey the relations 


w=rgy (39) 
J(b)(1 — cos 7)~* = g1, (40) 

where 
b = w[cos 7 + (sin 7 — 7 cos 7)/z]. (41) 


Equation (40) requires that (37) be exact at 0 = 0. 

The clipping model is suitable for use with a minicomputer since no 
numerical integrations are required. If j(b) is easy to evaluate, the iter- 
ation of (40) is easy to do by trial and error. The accuracy of the model 
is best determined by comparison with the results calculated by the 
method of Section II. However, it is possible to obtain a corrected 
waveform in the region of clipping by solving the implicit relation 


J(v(9)) = (cos 8 ~ cos 7r)g1 (0<A <7). (42) 
obtained from (37). 


IV. RECEIVER SENSITIVITY 
In Fig. 1 let the light power be 
Uu = Up + U; Cos 8 
= wl, Up 2UyS 0. (43) 
The photodiode assumed here is a specific n*tap* silicon unit having 
the dark current characteristic shown in Fig. 2. It is typical of a class of 
photodetectors developed by H. Melchior’ for lightguide applications 


not requiring the sensitivity of avalanche photodiodes.!® It has a quan- 
tum efficiency at 900 nm of 7 = 0.8 and a very low series resistance 
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Fig. 2—Dark current-voltage characteristic of the photodiode. Parameters of the 
characteristic (44) for regions (i) and (ii) are listed in Table I. 


=~ 1 Q. The characteristic of Fig. 2 has two exponential regions of the 
form 


J(v) = a(e% — 1). (44) 
The current generator g in Fig. 1 can be written 
g= yu. (45) 


The parameters a,G,7 of the photodiode are listed in Table I. 

The earphone is the ring armature telephone receiver® which provides 
an essentially flat response over the voice band, 300-3300 Hz. We shall 
specify this earphone by a resistance p and power sensitivity 6, both as- 
sumed independent of frequency over the band. The response may be 


Table | — Photodiode parameters, 
n = 0.80, y = en/hv = 0.58 V—! (at 900 nm) 
(i) v < 0.387 V (ii) v > 0.87 V 
a(A) 3.1 X 10-12 3.8 X 1079 
B(V-}) 38.70 19.35 
(a8)~} (Q) 8.3 X 109 1.36 X 107 
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characterized by the relation 
SPL = 81+10log(p/5) = (dB), (46) 


where SPL is the sound pressure level produced in a closed volume of 
6 cm? and p is the inband power available from a matched generator of 
resistance p. Equation (46) shows that, when p equals the power sensi- 
tivity 6, the telephone receiver produces a speech pressure level (SPL) 
of 81 dB, which is the average level found in the telephone network by 
surveys.!1,12,13 Representative values for p,6 may be taken as 


p = 1280 
6=2.5 X 107-7 W. (47) 


The transformer is assumed to be ideal over the band with primary 
taps to give a variable turns ratio n. The dc resistance is assumed neg- 
ligible; this reduces the dc bias on the junction to zero, the most advan- 
tageous value. The inductance of the secondary must exceed 0.07 H 
determined by p and the low-frequency cutoff. It follows that the size, 
weight, and cost of the transformer would be approximately proportional 
to the maximum required value of n. In the following we specify the re- 
ceiver by n; the effective ac load resistance of the photodiode is 


r=n2p. (48) 


The sensitivity of a receiver of given n will be defined as the value of 
u , which produces SPL = 81 dB at the ear; thus the sensitivity s is de- 
fined by the relations 


s=U, Pi=6 (49) 


where p; is defined by (23). There is a range of optical powers u1 (and 
hence a range of sensitivities s) allowed by (49) because an increasing 
turns ratio n can be used to compensate (until nonlinearities become 
significant) for a decreasing optical power u,. Figure 3 shows s plotted 
versus n. At the minimum we find by the method of Section II the 
values 


n = 48, r = 0.29 MQ (s = x) (50) 
and 


X = Smin = 2.6 uW. (51) 


The clipping model of Section III gives the same asymptotic straight line 
and the minimum value 2.5 wW. 

The asymptotic straight line represents the sensitivity § of the 
distortion-free receiver that would result if we could take j(v) = 0; the 
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SENSITIVITY s(W) 


X= 2.6 x 10-5w 





1 10 100 
TURNS RATIO n 


_ Fig. 3—Sensitivity s defined in (49) versus turns ratio n of transformer. The straight 
line shows the distortion-free receiver (52). 


equation of the line is 


— 
= — 


2 2 


Qrg2 2p? , 
=e =1* "532 — (distortion—free receiver). (52) 


V. RECEIVER HARMONIC DISTORTION 
We define the amplitude level U, by 


U, = 20 log (u;/x). (53) 


The need for the factor of 20 in this equation in comparison to the factor 
of 10 in (46) results from (7) the photoelectric effect in the photodiode 
by which the electrical signal power produced is proportional to the 
square of the optical power modulation uw, and (ii) the desire to express 
U, on the same logarithmic scale as SPL. In analogy to the present me- 
tallic telephone network we assume that the level reaching any receiver 
can be regarded as a random variable with a distribution!!!2:8 that is 
approximately normal with a variance of 7.8 dB. We define the dynamic 
range I of the optical channel 


r=U, — (Ui)N, (54) 
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DISTORTION DIN DECIBELS 





-10 —5 0 5 10 15 20 
INPUT LEVEL U,-S IN DECIBELS 


Fig. 4—Total harmonic distortion D defined in (27) versus input signal level U,; defined 
in (538) relative to sensitivity level S defined in (56) for two values of n. Various sound 
pressure levels SPL are indicated; for all n, U; = S corresponds to SPL = 81 dB. Points (7) 
and (ii) are chosen for waveform examination in Fig. 5. 


where (U;)y is the average over the level distribution N (see Appendix 
A), and 


U, = 20 log (u,/x) (55) 


is the clipping level of the channel. This will always be a bottom-side 
clipping level; we will also assume for simplicity that it is a top-side 
clipping level. We assume I is a system constant maintained by the 
electrical-optical interfaces, whereas U, and (U1)y fall off with the 
transmission distance x of the particular lightguide. The dynamic range 
determines the probability that clipping will not occur, which we here 
call the quality. A more general discussion of I and speech quality is 
given in Appendix A. 

Harmonic distortion Do2,D3, ... and total harmonic distortion D are 
defined in (26) and (27) respectively. These quantities are functions of 
U, and are only meaningful up to U; = U, = I. + (U1) Nn. We define a 
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WAVEFORM @(6) 





0 0.177 0.29 0.37 0.4 7 0.577 
PHASE §, RADIANS 


Fig. 5—Normalized waveform (20) for points (¢) and (ii) of Fig. 4 for half of fundamental 
range 4 6. Fa clipping model approximation is shown dashed and the parameter 7 is in- 
icated on (iz 


sensitivity level 
S = 20 log (s/x) 20 (56) 


with a similar definition for S of the distortion-free receiver (52). For 
our calculations we have chosen to consider the receiver at the reference 
point x defined by 


(U,(X))n =S, (57) 
or the distortion-free receiver at the reference point xpr defined by 
(Ui(Xpr))n = 8. (58) 


Since we will presently conclude that significant distortion in the receiver 
will be avoided in practice, the distinction between x and Xpr need not 
concern us further. For illustrative purposes we choose the value I’ = 18.2 
dB. The reasonableness of this value in terms of speech quality is dis- 
cussed in Appendix A. Figure 4 shows D versus U, — S for two receivers 
n = 47.7 and n = 13.4 out to U; — S = 18.2. The steep rise of these curves 
and the absence of any extensive straight portions show that D is not 
dominated by the second harmonic but involves a large number of har- 
monics. At several points the SPL at the fundamental is indicated to show 
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D (S+18.2) 


DISTORTION D IN DECIBELS 





0 5 10 15 20 25 
SENSITIVITY LEVEL S IN DECIBELS 


Fig. 6—Total harmonic distortion D (S + 18.2) at the channel clipping level for the case 
I = 18.2 dB versus sensitivity level S defined in (56). Dashed curve shows D(C) at receiver 
clipping level C defined in (61). 


that an 18-dB rise in level does not produce a corresponding rise in SPL 
when D is high. For a distortion-free receiver, (46) becomes with the help 
of (52), (53), and (56) 


SPL = 81+U,—-—S (distortion—free receiver). (59) 


The normalized waveform (20) is shown in Fig. 5 for two cases iden- 
tified as points (Z), (iz) in Fig. 4. The distortion of the waveform is shown 
to be a gradual clipping of the positive peak. In the clipping model this 
is approximated by abrupt clipping out to 6 = 7 as indicated for curve 
(ii). We have obtained good results in calculating Do, D3 and D by the 
clipping model when D > —40 dB; higher harmonics are given with di- 
minishing accuracy. 

Figure 6 shows the total distortion at the channel clipping level 
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/ 


CLIPPING FACTOR ® IN DECIBELS 





15 20 25 30 35 40 45 46 
LEVEL U, IN DECIBELS 


Fig. 7—Clipping factor (21) versus level (53) for two illustrative values of n. Dashed lines 
represent abrupt clipping approximation (61) which defines the clipping level C of the 
receiver. The clipping discount A (see Appendix A) is chosen as A = 6 dB. 


D(S + 18.2) versus S for a receiver at the reference point. Conceivably 
an optimization could be based on an upper-limit objective for D at the 
channel clipping level. The dashed curve shows D(C) at the receiver 
clipping level (to be defined in the next section). Notice that D(C) is 
approximately constant, so in the present instance the quality matching 
principle is in effect equivalent to requiring D < — —32 dB at the clipping 
level. 


VI. RECEIVER CLIPPING LEVEL 


The clipping factor has been defined in (21) for the general analysis 
and in (81) for the clipping model. The validity of the clipping model has 
been confirmed from the waveform and from calculations of D. Using 
the clipping model, we have calculated &(U}) for various receivers n. (For 
brevity, n is not explicitly indicated in writing ©.) Figure 7 shows the 
results for n = 13.4 and n = 6.5. It is known! that telephone speech 
quality, as determined in subjective listening tests, is not degraded ap- 
preciably by small clipping, & < A, where we call A the clipping discount 
and adopt the value 


A =64dB. (60) 


(This value has been deduced by us from an examination of the un- 
published work of A. M. Noll.!4) The discount is shown as a dashed line 
in Fig. 7. At each intersection of the discount with & we draw a line of 
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unit slope as shown. This represents abrupt clipping 


0 U,<C 
@(U,;) > 61 
Ee. A. Se on 
at the receiver clipping level C. Thus C (as a function of n) is defined 
by the relation 


(C+ A)=A. (62) 


Notice that C is not very sensitive to the value chosen for A; any value 
of A in the range 2 to 8 dB would give about the same C(+ 1 dB). 

If the concept of a clipping level is valid, the actual (Uj) for the re- 
ceiver can be replaced with the abrupt clipping approximation (61). A 
test of the validity of the concept is the distortion D at U; = C; for abrupt 
clipping D would be zero up to U; = C. Figure 6 shows D(C) versus S; 
it is approximately constant around —35 dB. The departure of ®(U;) 
from abrupt clipping above A is not of great significance, because the 
important question in quality determination is whether degradation 
occurs, not how much degradation has occurred. 


Vil. QUALITY MATCHING 


The optimum sensitivity is the smallest value consistent with the 
requirement that the quality of the channel not be degraded by the re- 
ceiver. This gives the principle of quality matching expressed by the 
relation 


C=U, (quality matching), (63) 


that is, the equality of receiver and channel clipping levels. This matching 
is to hold at all points x in the optical loop system. 

Figure 8 shows a schematic diagram of levels in a system versus the 
distance x from the optical source at the electrical-optical interface. We 
define the power transmission loss of the lightguide in the usual way 


TL(x) = 10 log [w(0)/u(x)] (dB). (64) 
At x = 0 the interface injects the optical power 
u(0) = h, + hycos 6 (65) 
at the levels 
Ho = 20 log (ho/x) = U,(0) 
Hy = 20 log (h1/x) = U(0). (66) 
It follows that 


U,(x) = Ho — 2TL(x) 
U(x) = H, — 2TL(x). (67) 
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TRANSMIT LEVEL IN DECIBELS 
RECEIVE LEVEL RL IN DECIBELS 





DISTANCE x 


Fig. 8—Levels versus distance in a quality matched lightguide loop system (receiver 
portion only). The transmit levels (referring to the optical source) are the channel clipping 
level U,, mean amplitude level (U1), and transmission loss TL defined in (55), (54), and 
(64) respectively. Receiver sensitivity level $ is also shown on the transmit scale. Receive 
level (68) is also shown. Reference point x is defined in (57). The optical source is charac- 
terized by H,; (H1)n is defined in (66). 


In Fig. 8, TL, Up, and (U1) yn are called transmit levels and are referred 
to the scale on the left. Also shown referred to the left scale is the receiver 
sensitivity level S. The receive level RL(x) defined by 


RL(x) = (SPL(x)) (68) 


is shown referred to the scale on the right. The reference point defined 
by (57) is denoted by x. The dynamic range I defined by (54) is a con- 
stant of the system. 

From (54), (57), and (63) we obtain the quality matching relations 


C(x) -S(x) =T (69) 

C(x) — C(x) = 2A(x), (70) 
where 

A(x) = TL(x) — TL(X). (71) 


Figure 9 shows C, C — S, and S$ plotted versus n. The implementation 
of quality matching is illustrated for the case T = 18.2 dB. We find point 
(i) on the C — S curve according to (69), which determines point (iz) on 
S and (iii) on C. The optical power at the source is then determined by 
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LEVEL IN DECIBELS 





1 2 4 6 8 10 20 40 60 100 
TURNS RATIO n 


Pig. 9—Receiver clipping level C defined in (61) versus n. Also shown are C — S, S, and 
S, where S is the sensitivity level (56) and S is the sensitivity level for the distortion-free 
receiver defined by (52). Points (i) to (v) trace the quality matching procedure based on 
(54), (63), (69), (70), and (74). 


(iit) through the relation 
H, = C(x) + 2 TL (72) 


in terms of TL = TL(x). The choice of TL determines the average RL of 
the system, which is a system objective (not necessarily 81 dB) we must 
leave open at this time. We find point (iv) on C according to (70), which 
determines the optimum value of n for a receiver at x. It also determines 
point (v) on S. (The intersection with S, when it differs from S, is not 
of interest.) From (22) and (uv) we determine the quantity 


P(x) = S(x) — S(x). (73) 


From (59), (67), and (68) the receive level for a distortion-free receiver 
is 


RL(x) = 81 + P(x) — 2A(x). (74) 


This is an excellent approximation for RL, because quality matching 
guarantees that distortion is too small to have any effect on response. 
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In Fig. 9 the lines C and S are given by the relations 


C = 70.5 — 44.3 log n (75) 
S = 32.6 — 20 log n. (76) 
If > 5 dB, it is justified to approximate 
C — S = 37.9 — 24.3 log n. (77) 
It follows that 
C(x) = 14+1.82T (78) 
P(x) = 0.90 A(x) (79) 
log n = 1.56 — 0.041 T + 0.045 A. (80) 
Thus the receive level is 
RL* RE = 81-1.1A. (81) 


and the clipping level at the optical source is 


Hy, =144+1.820 + 2Tr. (82) 


Vill. SUMMARY AND DISCUSSION 


We have obtained a practical algorithm suitable for a large computer 
for solving the nonlinear integral equation (3) referring to Fig. 1. The 
equation implies an integral restraint (4) on the solution which is an 
unusual feature that removes it from the types found discussed in texts 
on nonlinear networks. The load r is taken as zero at de and a constant 
resistance at all signal frequencies of interest. This implies that the load 
is dispersive at frequencies below a certain cutoff frequency (800 Hz in 
the receiver problem). Ordinarily a nonlinear dispersive circuit requires 
nonlinear differential equations to describe it. Here we have avoided the 
differential equation and obtained instead an integral equation (3) by: 
(t) asking only for the periodic solution, and (ii) treating separately the 
ac and dc voltages with different values of r. The assumption of zero de 
resistance is convenient and usually appropriate in practice, but the 
analysis presented in Section II can easily be generalized to any value 
of dc resistance. The nonlinear conductor j(v) is passive (j(0) = 0) and 
monotonically increasing (j(v)’ = 0) but otherwise arbitrary. The con- 
ditions on j(v) rule out negative-resistance instabilities and guarantee 
a unique solution. 

The method presented in Section II is exact in principle; it is based 
on the fact that any functional of the solution (e.g., a Fourier coefficient) 
can be calculated explicitly by integration once two parameters (a,b) 
have been determined from the implicit relations (12), (13). Standard 
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routines are available for solving simultaneous implicit relations to any 
desired precision. In using this method we have usually obtained satis- 
factory convergence with no special precautions. When convergence 
problems are encountered, the answer is to start the iteration with better 
estimates for a,b. 

The clipping model described in Section III was originally worked out 
to provide initial estimates of a,b for use in the exact method. It soon 
became apparent, however, that it is sufficiently accurate in the receiver 
problem for all calculations of sensitivity, total distortion (when D > —40 
dB), and clipping factor. The reason for this is that the exact waveform 
comes close to the clipped waveform assumed in the model. The clipping 
model is not recommended for the calculation of specific harmonics 
higher than the second. Generally the clipping model is expected to be 
useful whenever j(v) is a rapidly increasing function. In this model dis- 
tortion (clipping) is represented by a single parameter 7 which is de- 
termined from the implicit relation (40). No numerical integrations are 
involved in using the model, which makes it convenient for use with a 
minicomputer. 

The analysis of the photovoltaic receiver in the remainder of the paper 
is based on a sinusoidal input waveform. The calculation of the sensi- 
tivity s, defined in (49) as a measure of response based upon producing 
a certain reference sound level at the fundamental of 81 dB, is presented 
in Section IV. The photodiode and the earphone assumed in this cal- 
culation are the best presently available for the purpose. The transformer 
secondary must have an inductance of at least 70 mH, so the turns ratio 
n is adjusted by means of taps on the primary. By adjusting n, any value 
of s down to the minimum 2.6 »W can be obtained. However, the size and 
cost of the transformer are expected to increase approximately as the 
maximum value of n. 

The minimum s = x shown in Fig. 3 is a nonlinear effect having 
nothing to do with the impedance matching concept of linear circuit 
theory. The small signal resistance of the junction is given in Table I, 
(a@B)—! = 8.3 X 109 Q which is larger than r(x) = 2.9 X 10° Q by a factor 
= 3 X 104! This shows the necessity for a nonlinear analysis. For n > 48 
the distortion increases very rapidly at the reference level assumed for 
the calculation of s. This does not mean, however, that n > 48 is ruled 
out for receivers operating at much lower levels. In a properly designed 
system the receiver must respond almost like the distortion-free receiver 
defined in (52) at the levels to which it is subjected. The reference level 
SPL = 81 dB is the overall average level for receivers in the existing 
telephone system. 

At a particular point, the reference point, in a loop system the receive 
level defined in (68) will equal 81 dB, and at other points its value will 
depend on the transmission loss from the reference point to that point. 
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For a discussion of total harmonic distortion D we have considered a 
receiver at the reference point in Section V. We find (Fig. 4) that D rises 
very rapidly with level up to about D ~ —20 dB and then begins to bend 
over. The value of D of interest is the value at the channel clipping level 
shown versus sensitivity in Fig. 6 for an assumed dynamic range of 18.2 
dB. The results confirm that D is much too high in the minimum sensi- 
tivity receiver for use at the reference point. It is conceivable that some 
criterion on D (e.g., D < —20 dB) could be used as the basis for optimizing 
the receiver (choosing n). However, we believe that clipping provides 
a more objective and less arbitrary basis for optimization. The existence 
of clipping is shown by the waveform of Fig. 5 and by the good agreement 
between the clipping model and the exact method of D. 

The clipping factor defined in (21) has been calculated as a function 
of level in Section VI. At a certain value A, called here the clipping dis- 
count, abrupt clipping begins to degrade speech quality.!4 Therefore an 
abrupt clipping approximation has been fitted to the receiver clipping 
factor at the value A. This defines the clipping level C as illustrated in 
Fig. 7 for the choice A = 6 dB. Actually C is not very sensitive to A. The 
total distortion at level C would be zero for abrupt clipping. We find D(C) 
~ —36 dB for all receivers (Fig. 6); in our opinion this is small enough to 
confirm the validity of the clipping level concept. 

Quality matching as an optimizing principle is defined in Section VII. 
It amounts to setting the channel and receiver clipping levels equal as 
in (63). The quality of speech transmitted in an analog channel in which 
the only nonlinearity is abrupt clipping is determined by the dynamic 
range, defined for sinusoidal signals in (54). The dynamic range I is in- 
dependent of lightguide transmission loss so we consider it a constant 
of the optical loop system. In Fig. 8 we pass from a strictly device view- 
point to a system viewpoint. The quality matching relations (69) de- 
termine the optical power needed at the reference point as well as the 
optimized receiver and receive level throughout the optical loop system. 
The various levels are shown schematically in Fig. 8 as a function of 
distance x from the optical source assuming the same source power for 
all loops. The curves required to obtain the solution are shown in Fig. 
9, and approximate equations for the solution are given in (78) through 
(82). 

In the existing system the mean loop insertion loss!* is about 5 dB. The 
receive level defined in (68) is 81 dB at a point 5 dB from the central 
office, and 81 dB is approximately the mean loop receive level. This is 
not necessarily an objective for future loop planning, so we here em- 
phasize that our analysis contains no such assumption. The use of 81 dB 
as a reference is only a convenience and involves no loss of generality. 
The reference point x in Fig. 8 with the transmission loss TL need not 
be the “average point” for the loop system. The mean loop receive level 
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is from (81) 
(RL); = 814+ 1.1 (TL — (TL)z) (83) 


where ( )7 denotes an average over loops. This can be set at any desired 
level by properly choosing TL. 

The quality matching concept requires a receiver with adjustable turns 
ratio which probably involves some added cost compared with a fixed 
receiver. However, any fixed receiver would give a receive level varying 
as —2A, 


RL = 81 —-—2A (fixed receiver), (84) 


whereas quality matching gives (81) varying approximately as —A. This 
effect is shown in Fig. 8 by the different slopes of RL and (U1) ny. Clearly 
this is a desirable effect which utilizes the capabilities of the receiver to 
the fullest extent and permits serving a radius approximately twice that 
which would be possible with a fixed-sensitivity receiver having the 
sensitivity of an optimized receiver at the reference point. The actual 
range of the loop system would probably be limited by objectives on the 
minimum RL which we are not discussing here. Another limitation which 
we can only mention is that of transformer cost. From (80) we find that 
the cost, as measured by n, doubles for an increase of 6.7 dB in TL. 

To illustrate the principles being discussed, we have chosen a realistic 
case specified by 


Il =18.2dB 
TL=5dB (illustrative) 
A(x) = 7.5 dB. (85) 


Points (7) to (v) in Fig. 9 trace the solution for this case. At the reference 
point x we find 


C(x) = U,(x) = 34.6 dB 
Uo(x) = 0.14 mW 


s(x) = 17 uW. (86) 
At the point x we find 
C(x) = 19.6 dB 
n(x) = 14.1 
RL = 72.8 dB. (87) 
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At the source we find 


H, = 44.6 dB 
(H1)n = 26.4 dB 
h, = 0.45 mW 
peak source power = 2h, = 0.9 mW. (88) 


The fact that the peak source power is 2 h, follows from our assumptions 
of equality of top and bottom side clipping and of occurrence of bottom 
side clipping at h = 0. 

The sensitivity at the reference level s(x) = 17 nu W is 50 times smaller 
(better) than that of the best optoacoustic receiver, the xenon photo- 
phone? (s = 0.9 mW). Furthermore, the photophone is a fixed receiver. 
Therefore the photovoltaic receiver is clearly superior for loop applica- 
tions. 

The “system” referred to here should be regarded as a relatively small 
subsystem of the loop plant serving a special class of customers who have 
lightguides running to their premises primarily to provide high capacity 
services. At the central office or other junction point there must be an 
electrical-optical interface containing the optical source for the receiver. 
The peak power of this source according to (88) should be 0.9 mW. This 
power into the lightguide is within the capabilities of present day het- 
erojunction laser diodes!® but about an order of magnitude above the 
capabilities of luminescent diodes.!® To a limited extent the power at 
the source might be varied to compensate for the transmission loss; one 
could imagine all adjustment being done at the source instead of the 
receiver. Although we have described a system with fixed source power 
in Fig. 8 and believe that this is the most likely type of system, no change 
is required in the equations to treat the variable source. The dynamic 
range, however, would be determined by the interface circuitry and held 
to a uniform value to maintain transmission quality. 

It may be objected that our nonlinear analysis has been based on a 
sinusoidal input signal whereas a telephone is required to transmit 
speech. Appendix A contains a discussion of the extension of the theory 
to a general waveform and the general definition of dynamic range. It 
is argued that the clipping level is valid for any waveform. A quality Q 
is objectively defined on the basis of the approximately normal distri- 
bution of speech levels in the telephone system. Figure 10 shows Q versus 
I’ for speech and for a hypothetical system having sinusoidal signals. 
Kssentially, Q is a probability that speech is received without degradation 
caused by clipping. The illustrative case I = 18.2 dB used in this paper 
corresponds to Q = 86 percent and Q,o; = 99 percent. An objectively 
defined quality is not equivalent to a grade of service determined sub- 
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Fig. 10—Objective speech quality Q defined in (104) versus dynamic range I’ defined 
in (93) plotted on probability scale. The corresponding quantity for sinusoidal waveform 
is shown dashed. 


jectively from listening tests, although we would expect a strong corre- 
lation between the two. 

We shall not speculate on the circuitry of the interface, but we offer 
an interesting observation in Appendix B. On the basis of the light- 
emitting characteristic of laser diodes and the signals from the metallic 
network existing at the central office, it is shown that the direct appli- 
cation of the metallic network signal current (no amplifiers or trans- 
formers) to the laser would produce a somewhat greater level than (H1) 
called for in (88). 

Let us suppose that TL = 5 dB in the illustrative case (85) is the av- 
erage loss and the lightguide attenuation constant!” is 2 dB/km. Then 
the reference point, which is also the average point, is at the distance 


xX = 2.5 km = 8.8 kft (illustrative). (89) 


This distance may be compared to the average loop length!® 10.3 kft in 
the present metallic loop plant. A more conservative estimate of atten- 
uation, like 6 dB/km, reduces (89) by a factor of three. Thus, the optical 
loop system seems to be limited to a somewhat smaller radius of service 
than the metallic loop system. 
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APPENDIX A 
Speech Quality and Dynamic Range 


We continue to let 6 denote a normalized time variable, but in (43) and 
(65) we replace cos 6 with a more general normalized waveform Y/(@) 


max ¥(@) = —min y¥(@) = 1 
(¥(8))o = 0. (90) 
Define the peak factor 
W = —20 log Wms (91) 
and signal level 
U = 20 log (ui¥rms/Xx) 
=U,-W (92) 
where U, is given by (53). Define the dynamic range 
T= (20 log (up/2¥/?uiypms)) N 
=U,—-—(U)n-3 (93) 
where the channel clipping level U, is given by (55) and where 


(Ue f " U(N)AN, (94) 


N being a distribution function. For ¥(0) = cos 6 we have Y = 3dB and 
(93) reduces to the definition first given in (54). For the distortion-free 
receiver the SPI is 


Spr = 81+U—-S+3, (95) 
which reduces to (59) when VW = 3. We assume (61) and (63) remain valid 
but write (61) in terms of U 


0 U<U,-wW 
UY =U, Le Ug Se: 


The levels U are distributed’!!!° such that the probability that 
U<X is 


&(U) = | (96) 


P(U < X) = N[(X — XQ)/o] (97) 
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where 
Xo = (U)nN, o = 7.8dB (98) 
and N(x) is the normal distribution | 


N(x) = (2m)? ff eo t/2 dt. (99) 


co 


It is known from subjective studies!‘ of the effect of clipping on telephone 
speech quality that no degradation occurs for  < A, where we take 


A = 6 GB. (100) 


(This is a deduction from the work of A. M. Noll for which we assume 
responsibility.) The probability that @ < A is 


P(@ < A) = N[(U, + A- YW - (U)y)/o] 


=N[(T+A+3-W/ol. (101) 
The speech waveform is characterized by the value!® 
VY =18.6dB (speech), (102) 
so that 
A+3-—-wW=-9.6dB (speech). (103) 


The objective definition of quality is 
Q = 100 X P(@ < A) (speech) 
= 100 X N[(T — 9.6)/7.8]. (104) 


A corresponding definition can be given for sinusoidal signals having the 
same level distribution assuming A = 0 and V = 3 


Qcos = 100 X N(T/7.8). (105) 
Figure 10 shows Q and Q,o; versus I on a probability scale. 
APPENDIX B 
Comments on the Optical Source 


A laser diode has a threshold current for lasing and at higher currents 
a steeply rising emission h as a function of current 1. In the lasing region 
we assume!? 


dh/di = « 
€= 2.3 W/A. (106) 


If a sinusoidal current of amplitude 7; flows through the laser (super- 
posed on a suitable bias current), the optical signal level H, defined in 
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(66) is 
Hy, = 20 log (e1/x). (107) 


Suppose that the current 1; is that supplied to a matched load by a 
generator of resistance r and available power p; then (107) can be writ- 
ten 


Hy = 20 log [21/2(€/x)(p/r)¥/7]. (108) 


Finally, suppose that the generator is the central office and p corre- 
sponds to the mean signal level; representative values are?° 


p=2uW, r= 11660. (109) 
It then follows that the mean optical amplitude level at the source is 
(Hi)N = 34dB (hy = 0.13 mW). (110) 


This compares very favorably with the level called for, 26 dB, in (88). 
This shows that the current flowing in a matching resistor (1166 0) 
combined with the “gain” of the laser emission characteristic provides 
more than enough signal without a matching transformer in the cir- 
cuit. 

If a matching transformer is used, r in (108) is replaced by the dynamic 
laser diode resistance!®?! rg ~ 1.5 Q, giving h; ~ 3.8 mW. For a lumi- 
nescent diode an appropriate value of ¢€ in (106) is!®« ~ 5 x 10-4 W/A 
giving for the transformer matched case with rg = 1.5 2 the power h ~ 
0.8 uwW, (Hi)n = —10 dB. 
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An experiment has been performed to study the perceptual char- 
acteristics of speech processed by adaptive differential PCM. We created 
18 three-bit and four-bit coders spanning a wide range of quantizer 
adaptation parameters. Subjects judged differences between coders 
and rated the quality of each coder individually. The difference data 
reveal three important perceptual characteristics: overall clarity, signal 
vs. background degradation, and rough vs. smooth impairment. These 
characteristics are strongly correlated with coder design parameters 
and objective performance measures. Ouerall subjective quality is well 
predicted by segmental signal-to-noise ratio and even better by a linear 
combination of measures of granular distortion and overload distor- 
tion. 


1. INTRODUCTION 


Speech signal processing systems are susceptible to a variety of audible 
impairments often classified with words like “distortion,” “noise,” 
“echo,” and “‘sidetone.” These categories are themselves subdivided: 
for example, “linear” and “nonlinear” distortion, “white” noise, “im- 
pulsive” noise, ‘“speech-dependent” noise, etc. When the type of system 
is familiar to a large body of listeners, the application of these names 
becomes standardized and a language exists for describing the quality 
of specific implementations. With new systems, however, the types of 
degradation are often not known a priori, and special effort is required 
to identify them and to relate them to physical characteristics of the 
system. 

For example, experiments on PCM (pulse code modulation) have 
identified peak clipping, granular quantizing noise, and bandlimiting 
as important audible degradations.!* In PCM, there are relatively few 
design parameters, and each of these impairments can be related to one 
of them: peak clipping to quantizer overload point, granular noise to step 
size, and bandlimiting to sampling rate. 
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In ADPCM (adaptive differential PCM), a coding method that appears 
promising for a number of practical applications, the situation is more 
complicated. Here each design parameter has interrelated effects on 
several types of degradation, and the perceptual correlates of a particular 
design are hard to predict. In ADPCM, the step size and overload point 
vary with time, producing a dynamic mixture of overload and granularity 
that depends on the adaptation mechanism. One can identify in the 
quantized waveform two types of overload: overload that causes clipping 
of stationary inputs and in addition, overload due to slow quantizer re- 
sponse to increases in short term (syllabic) signal level. Moreover, a 
mathematical study? has identified two separate aspects of adaptive 
quantizer performance: static (response to constant-level inputs) and 
dynamic (response to changes in input level). However, it is by no means 
evident or even likely a priori that these mathematically separable 
characteristics are perceived separately. 

To investigate perceptual characteristics of speech processed by 
ADPCM we conducted an experiment that is summarized in the next 
section. The following five sections provide details of the coding method, 
objective performance measures, the experimental design, and analyses 
of subjective and objective measurement data. Section VIII discusses 
the implications of the principal findings. 


ll. SUMMARY 


High-quality digital recordings of speech samples from four talkers 
(two male and two female) were processed according to 18 different 
ADPCM coding schemes on a digital computer. The coders incorporate 
all combinations of two bit rates, three load constants, and three time 
constants. The data obtained from the experiment consisted of two types: 
objective measurements and subjective judgments of the processed 
speech. With these two types of data we addressed the following ques- 
tions: 


(t) What are the perceived characteristics of speech processed by 
ADPCM? 

(it) How are these characteristics related to subjective judgments 
of quality? 

(tii) What is the relationship between objective performance measures 
and the perceptual features? 

(iv) What is the relationship between objective performance mea- 
sures and judgments of circuit quality? 

(v) What combinations of design parameters produce coders within 
a given quality range? 


The analyses of the data indicate the following answers to each of the 
above questions: 
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(i) Listeners perceive three distinct characteristics of the processed 
speech: (a) the overall clarity, (b) the kind of degradation that reduces 
the clarity, namely, whether the degradation is signal distortion or noise 
or both, and (c) the nature of the signal distortion and/or the nature of 
the noise. 

(iz) Quality judgments are correlated with all of these subjective 
variables. The overall clarity is by far the strongest correlate. 

(vit) Signal-to-noise ratio, measured segmentally, SNRseg, is a good 
predictor of the overall clarity. The log of the ratio of attack to recovery 
speed, log A/R, is a good predictor of the mixture of signal distortion and 
background noise. The log of the attack time, log T,, predicts the kind 
of signal distortion and/or the kind of noise. 

(tv) SNReeg is a very good predictor of quality judgments, while SNR 
measured in the traditional manner is a very poor predictor of quality. 
A linear combination of probability of overload, P, and segmental sig- 
nal-to-granular-noise ratio, SNRGseg is an even better predictor of quality 
than SNRgeg. 

(v) By applying the prediction equations to coders with design values 
intermediate to those of the experiment, we show the combinations of 
load constant and time constant at each bit rate that would be judged 
about equal in quality. Those that would be rated almost as highly as 
the best coder cover a wide range of design parameters. 


ll. CODER DEFINITIONS 


Figure 1 is a block diagram of an ADPCM coder-decoder. In the absence 
of transmission errors, the sequence of received samples r’(k), is identical 
to the quantized approximation sequence r(k). In our experiment s(k) 
was a digital speech signal represented in a 12 bit, 8 kHz format and the 
coders were realized in software on a Data General Eclipse computer. 

The conversion from 12-bit PCM to 3-bit or 4-bit ADPCM is performed 
according to the algorithm described by Castellino et al.4 In all of the 
coders the predictor is a two tap transversal filter with coefficients 1 and 
—0.5 so that the relationship of approximation signal, r(k), to quantizer 
output, d(k), is 

—r(k) = d(k) + r(k — 1) — 0.5r(R — 2). (1) 

Signal-level estimation. The step size, A(X), which is derived from 
the sequence of quantized prediction error samples d(k) or equivalently 
from the transmitted code words, I(k), is proportional to an estimate 
of the mean absolute value of the quantizer input, e(k). The estimate 


at time k, o(k), is an exponentially weighted sum of quantizer output 
magnitudes. It is computed recursively as 


o(k) =ac(k-1)+(1L-aldk-D|. (2) 


SPEECH PROCESSED BY ADAPTIVE DIFFERENTIAL PCM 1599 





INPUT s(k) 







K(k} 


DECODER 


CODER 
STEP-SIZE 
F LOGIC a 









Stk) _ | srepsize Hi 
PREDICTOR siz 
P LOGIC a 
(+) DECODER 
OUTPUT 


Fig. 1—Block diagram of ADPCM coder-decoder. 


Here, the parameter a (0 < a < 1) determines the speed of response of 
the quantizer to changes in input level. A low value of a provides a fast 
response; a ~ 1 is associated with a slow response. A compromise between 
static and dynamic performance objectives is required in selecting a. A 
high value (a ~ 1) provides more accurate “steady-state” tracking of a 
constant signal level than does a low value. 

If the signal level suddenly increases, the estimate o increases with 
an initial slope proportional to 1 — a volts/sample (if « is measured in 
volts). In this paper we shall refer to an adaptation time constant, 7 sec, 
that is the reciprocal of this initial slope in response to a unit step. It is 
defined by 





C= sec 


where T is the sample period. For 8 kHz sampling 


0.125 
T= msec. 
l-a 


To incorporate a perceptibly wide range of signal conditions, we have 
selected, after informally listening to a large number of coders, 3 values 
of a for the experiment. They are a = 1/, 34/49, 2556 with corresponding 
time constants: 7 = 0.25, 4, 32 msec. 

Quantizer loading. The quantization step-size A(R) is proportional 
to the signal-level estimate, o(k): 


A(k) = Ca(k) (3) 
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where the load constant, C, determines in steady-state (fixed signal level) - 
the mixture of granular noise and overload distortion in d(k). A relatively 
high value of C produces a large average step size and causes granularity 
to be the principal distortion component. With a very low value of C, 
overload predominates. 

For a given number B bits/sample, we define a nominal load constant, 
Co. Co is the step size which produces minimum mean square error in 
a fixed quantizer processing a signal that has Laplacian probability 
density with unity average magnitude. In fact, it has been noted that the 
shape of the PDF of the compressed signal e(k)/o(k) is between a 
Gaussian and a Laplacian function, and more near to the Laplacian one 
for higher time constants.° For B = 2, 3, 4, 5 bits, Co = 1.53, 1.08, 0.65, 
0.40, respectively. We define a relative load constant for each quantizer 
to be 


L = 20 logio © dB. 
Co 


After listening informally to speech processed by a variety of coders, we 
chose for the experiment three relative load constants, L = —10, —4, 4 
dB. 

It has been shown® that this adaptive quantizer is a special case of the 
one with multiplicative step size changes’: 


A(k + 1) = MII(k)JA(R). 


The multipliers associated with code words J(k) = +1, £2,... +2871! 
are 


M[I(k)] = a + (1 ~ a)C((I(R)| - 0.5). 


Dynamic and static behavior. The dynamic behavior of the coder can 
be described by two characteristics: the attack and recovery speeds. The 
attack speed is defined as the step size increase (in dB) per unit time 
when the signal level suddenly changes from a very low value to a very 
high value. The recovery speed is defined as the step size decrease (in 
dB) per unit time when the signal level suddenly falls. The attack and 
recovery speeds can be computed from the largest and smallest multi- 
pliers?: 


9 B 

Mar 1 ee 

T sec 

-20 
T 


where T is the sampling time, and n = 22-1. A small attack speed will 
produce slope overload distortion, while a small recovery speed will result 


Vy = 


logM (I) = 
sec 
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in greater granular distortion. The static behavior is also affected by 
attack and recovery speed: a high attack and low recovery speed result 
in a step size that is higher on the average than that resulting from a slow 
attack and fast recovery. A very good indicator of static performance is 
the attack to recovery ratio: 


Yo logM(I,,) 
A= = — 
/ vr, —logM (1) 


Attack time, which we have found to be strongly correlated with the type 
of distortion or the type of noise produced by a coder, is the reciprocal 
of attack speed: 


Summary of conditions. The experiment includes coders with 3 
variable design parameters: B bits/sample, 7 msec response time, and 
L dB relative load constant. The 18 coders comprise all combinations 
of B = 8, 4; 7 = 0.25, 4, 32; L = —10, —4, 4. 


IV. OBJECTIVE MEASURES 


Our aims include exploration of the relationships between perceived 
characteristics of the processed speech and objectively measurable 
quantities. To investigate these relationships, we have computed several 
objective performance indices for each processed utterance. The mea- 
sures are defined as follows: 

Total signal-to-noise ratio. 


Ys2(k) 
D[s(k) — r(k)]2 
Here k ranges over all samples in the utterance, and r(k) is defined as 


the best estimate of s(k). 
Granular signal-to-noise ratio. 


SNR = 10 log 


Ys2(k) 


SNRG = 10 lo ER 
E SIse(k) — re(k)P? 


where k& ranges over all samples in the utterance and the signals s,(k) 
and r,(k) are defined only when the quantizer is not overloaded; that 
is, when the quantization error is less than one-half the step size: 


sg(k) = s(k); rg(k) = r(k) if |s(k) —r(h)| < 
se(k)=r(k)=0 if |s(k)—r(k)| > i 
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Percent of samples overloaded. 


N 
dX sg(k)/s(R) 


k=1 
P=100)1- 
N 


where WN is the total number of samples in the utterance. 
Total segmental signal-to-noise ratio. This is a measure proposed 
by Noll® as a more relevant index of speech quality than SNR: 


y s?Vj + 128m) 


12 
j=l 


1 M-1 
SNReeg = u 10 log 


m=0 


> [sG + 128m) —r(Uj + 128m)]? 
j=l 


Here the utterance is divided into segments each containing 128 samples 
(16 msec) and the signal-to-noise ratio in each segment is measured in 
dB. The average of these measures over the M segments in the utterance 
is SNReeg. 

Granular segmental signal-to-noise ratio. 


128 
> s? G + 128m) 
1 Mz=1 j= 
SNRGseg = M du, 10 log 

m= 


> [sg + 128m) — rg(j + 128m)]? 
j=l 


Here the same procedure as for SNRseg is applied to samples s,(k) and 
rg(k). 


V. TESTING PROCEDURE 


Digital recordings* of 10 sentences spoken by each of 4 talkers (2 male 
and 2 female) were processed by each of the 18 coders. The processed 
sentences were equalized to the same mean power to eliminate level 
differences due to quantizer overloading and thereby minimize differ- 
ences in subjective loudness. Four analog test tapes, each containing the 
153 possible pairs of coders, were prepared from these recordings. The 
speech samples in a pair of conditions were the same sentence by the 
same talker. The talkers and sentences were assigned to coder pairs so 
that they occurred as equally as possible on a tape. A pair of coders 
processed a different talker and sentence on each tape and the order of 


* The source speech was the set of digital tape recordings used in a previous experiment 
on PCM. 
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presentation within the pair was reversed on half of the tapes. The coder 
pairs appeared in a different (random) order on each tape. 

Students from the junior and senior classes of local high schools served 
as paid subjects. They listened to the processed speech over Pioneer 
SE700 earphones at 80 dBSPL while seated in a double-walled sound 
booth with frequency-weighted room noise introduced at a level of 50 
dBA. 

Dissimilarity judgments. In the first experiment, 17 subjects (3,4,5,5 
per random order) judged the pairs of conditions. They were told to use 
the numbers from 0 to 9 to indicate how different the speech sounded 
over each pair of coders, using a 0 for no difference, a 9 for very different, 
and the numbers between 0 and 9 for intermediate differences. Before 
the test session began they judged 6 pairs, for practice, that represented 
the expected range of differences. 

Preference judgments. In the second experiment, 16 different subjects 
(4,5,4,3 per random order), also junior and senior high-school students, 
listened to the same tapes containing the pairs of coders. However, this 
time the subjects were instructed to indicate which condition of each pair 
they would find more acceptable for listening to speech. 

Rating judgments. In the third experiment, subjects judged the 
quality of the coders individually. Eight audio tapes were prepared, each 
containing 36 sentences. Tapes 1-4 had the processed speech (played 
through the 18 coders) of one male and one female talker. The other two 
talkers appeared on tapes 5-8. The stimuli on tapes 1-4 appeared in 
different (randomized) orders. The same 4 orderings were used for tapes 
5-8. The sentences occurred as equally as possible on each tape. 

In this experiment the subjects were asked to rate the quality of the 
36 conditions according to the adjectives: excellent, good, fair, poor, 
unsatisfactory. Their answer sheets contained 36 rows of short lines 
separated into 9 columns. The odd columns were labeled with the ad- 
jectives and the even ones unlabeled, allowing the subjects to check in- 
termediate ratings if they chose to do so. On half the answer sheets the 
order of the labels were reversed. Tapes 1-4 were presented to the 17 
listeners of the first experiment in a short session that took place 5 
minutes after the completion of the difference judgments. Tapes 5-8 
were presented to the 16 listeners of the second experiment 5 minutes 
after the completion of the preference judgments. 


Vi. INITIAL DATA REDUCTION 


The experiment was performed to provide information about rela- 
tionships between ADPCM coders and it is expected that for the most part 
differences in listener responses are due to coder differences. The ex- 
periment was designed to cause other sources of variability to be mu- 
tually cancelling in the average data for each coder. Sources of extraneous 
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variability are: differences in the way listeners use the response scales, 
differences in speech material, and effects of presentation order. Before 
aggregating the data for individual coders it was necessary to assess the 
importance of each of them. 

Difference judgments. The variability due to listener differences is 
revealed by the correlation coefficients of pairs of subjects who heard 
the same tape. These coefficients have a mean value of 0.61 and standard 
deviation of 0.09, indicating substantial agreement. The effects of pre- 
sentation order and talker were tested by means of an analysis of variance 
which showed that the responses to the 4 random orders were not sig- 
nificantly different. The variability due to the different talkers was 
significant at the 0.05 level, but accounted for only 1 percent of the total 
variance. This variability is due to the fact that speech of female talkers 
was rated differently from speech of male talkers. There was no signifi- 
cant difference in the ratings of talkers of the same sex. 

The important variability in the difference data can therefore be at- 
tributed to coder differences, and to assess these differences, we nor- 
malized the 153 responses of each subject to zero mean and unity stan- 
dard deviation. The averages, across the 17 subjects, of the normalized 
responses were the elements of a dissimiliarity matrix which was ana- 
lyzed according to the MDSCAL®~!8 procedure. 

MDSCAL locates points, representing the stimuli, in a multidi- 
mensional space so that the distances between the points are monoto- 
nically related to the judged differences. Because the dimensionality of 
a solution is specified as input, successive solutions of increasing di- 
mensionality are usually computed. Then, the stress values (essentially 
the root mean square error) and the interpretability of each solution are 
used as criteria for deciding upon the smallest number of dimensions 
that are needed to explain the data. The stress values give a measure of 
how well the distances in the solution spaces correspond to the reported 
differences among the coders. Solutions in 1, 2, and 3 dimensions for the 
difference judgments among the 18 coders had stress values of 0.25, 0.11, 
and 0.07, respectively. The large decrease in stress between the 1 and 
2 dimensional solutions indicates that at least 2 dimensions are needed 
to account for the data. Although a 3-dimensional solution accounted 
for only a small additional decrease in stress, it offered an enhanced 
interpretation of the subjective space. (See Section VII.) 

Preference judgments. The preference data were analyzed according 
to MDPREF,!*)5 a factor analytic procedure that measures the variability 
in preference among the subjects. The proportion of the total variance 
contributed by each factor is related to the agreement among the subjects 
on the relative importance of different characteristics of the stimuli. In 
the solution for the preference judgments of the 18 coders, the first factor 
accounted for 0.89 of the variance and the second accounted for only an 
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additional 0.02, indicating strong agreement among the listeners and 
a single factor solution. Therefore, the values of the points from the one. 
factor solution were used for the scale of preference. 

Rating judgments. We computed the correlations of the ratings of 
each subject with those of each of the other subjects who listened to the 
same tape. The distribution of the correlation coefficients had a mean 
of 0.78 and standard deviation of 0.08, again showing a high degree of 
agreement. Therefore, the mean across subjects of the individual re- 
sponses, normalized so that the ratings of each subject had zero mean 
and unit variance, were used in an analysis of variance due to coder pa- 
rameters, talkers, and random orders. The 3 design variables, load 
constant, time constant, and bits, were all significant at the 0.05 level. 
The variability due to the random orders was not significant, but the 
variability due to the different talkers was significant. As in the differ- 
ence experiment, the significant talker variability was due to differences 
between the male and female talkers, accounting for only 1 percent of 
the variance. 

Rating vs. preference. The two types of quality judgments, preference 
and rating, were obtained so that the two testing methods could be 
compared. The paired comparison tapes were designed to balance many 
of the sources of variability that are artifacts of the testing procedure. 
Each coder was heard an equal number of times with each talker and 
approximately an equal number of times with each sentence. The order 
of presentation was reversed on half of the trials and, of course, the rel- 
ative merit of each coder was ultimately determined by comparing it with 
every other coder. In the rating judgments, the merit of a coder was de- 
termined by one presentation per talker. Ratings assume that the quality 
represented by the five adjectival categories are not only well defined 
for each individual, but are essentially the same for all individuals. Al- 
though the ratings were normalized before computing the analysis of 
variance, the more customary procedure is to simply average the original 
judgments across subjects. Therefore, to compare the results of the rating 
study with those of the more critical paired-comparison study, the mean 
across subjects of the original unnormalized ratings were used. 

Figure 2 shows a scatter plot of these ratings vs. transformed quality 
measures from the one factor MDPREF solution of the paired-comparison 
judgments. (The linear transformation scales the maximum and mini- 
mum measures to one and nine, respectively.) As this plot shows, the 
agreement in ratings for the two methods is extremely high: the corre- 
lation is 0.99. Thus it appears that the uncontrolled sources of variability 
that could contaminate simple rating judgments did not have a strong 
influence on the variability of these data. The additional experimental 
effort involved in collecting paired-comparison judgments in order to 
control this variability did not increase the accuracy. 
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Fig. 2—Relationship between overall evaluation of coders by category ratings and paired 
comparison preference methods. 


Vil. RESULTS 


The output of MDSCAL is a set of points, representing the coders, in 
Euclidean space. The inter-point distances are related to the judged 
differences between coders and the orientation of the points in this space 
is supposed to reveal the underlying perceptual characteristics of the 
coders. However, the distances are invariant under orthogonal rotation 
and it is often the case that a rotation of the MDSCAL coordinates is 
necessary to interpret the configuration in terms of known coder prop- 
erties. 

In evaluating MDSCAL analyses of varying dimensionality, we con- 
cluded that a 3-dimensional geometry would be most informative. We 
approached the rotation problem by using multiple linear regression 
procedures to locate vectors in the 3-dimensional space on which the 
projections of the points are maximally correlated with various objective 
measures and design parameters. We also located the vector on which 
the projections of the points are maximally correlated with the average 
subjective ratings obtained in the third experiment. The vector for each 
measure of the 18 coders was located independently. Table I displays 
some of the measures for which vectors were derived and Table II gives 
the correlations between measurements and the corresponding projec- 
tions on vectors in the MDSCAL space. These vectors are an aid to the 
interpretation of the subjective space because they make it possible to 
relate directions in space to changes in design parameters and perfor- 
mance measures. As a visual aid, the coordinate axes were rotated so that 
they nearly or exactly coincide with meaningful directions. 
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Table | 


Cc ae. Ly Fe Fs, SNR, SNReez, SNRG, SNRGgee, P, 
Oe sta dhe mca Gn de dB i a 
1 3 0.25 4 0.01 0.15 12.63 10.5 8.8 10.5 8.8 0.07 3.4 
2 3 0.25 —-4 0.03 0.04 1.20 14.8 15.7 20.3 19.0 6.69 5.7 
3 3 0.25 -10 0.21 0.03 0.18 2.9 4.5 24.9 23.7 58.28 2.4 
4 3 4.00 4 Q.11 2.51 23.99 11.1 9.0 11.1 9.2 0.11 3.2 
5 3 4.00 —-4 0.37 0.68 1.84 13.1 14.6 20.9 19.6 6.15 5.6 
6 3 4.00 -10 3.27 0.54 0.17 2.1 4.2 24.3 23.0 57.82 2.5 
7 38 32.00 4 0.79 20.13 25.54 11.1 5.5 11.2 5.9 0.60 2.3 
8 3 32.00 —4 2.89 5.45 1.89 7.9 11.2 20.7 17.8 13.03 4.4 
9 3 32.00 -10 26.16 4.39 0.17 0.7 3.6 22.4 22.9 63.79 1.6 
10 4 = 0.25 4 0.01 0.05 5.382 15.9 14.8 16.0 14.8 0.07 4.9 
11 4 0.25 -4 0.02 0.03 1.41 19.2 20.4 24.5 23.1 2.75 7.4 
12 4 0.25 -10 0.06 0.02 0.40 8.9 11.7 29.9 28.1 25.74 5.2 
13 4 4.00 4 0.08 0.94 12.53 17.1 15.5 17.2 15.7 0.11 5.2 
14 4 400 -4 0.23 0.57 2.50 17.0 19.7 25.5 23.9 2.59 7.2 
15 4 4.00 -10 0.85 0.51 0.59 7.4 10.3 29.8 27.9 24.91 6.2 
16 4 32.00 4 0.55 7.60 13.73 17.1 13.0 17.7 13.5 0.44 4.0 
17 4 32.00 -—4 1.78 4.68 2.60 11.8 16.2 25.5 22.3 6.81 6.1 
18 4 32.00 -10 6.78 4.10 0.60 3.3 9.4 28.9 27.1 34.90 3.5 
Table II 
Objective Corr. with vector 
measure values 
SNR 0.95 
SNRse 0.96 
log ARR 0.96 
SNRG 0.94 
SNRG seg 0.95 
P overload 0.98 
log T. 0.92 
log T, 0.85 
Rating 0.99 


As a further step in interpreting the space, we listened to one of the 
tapes used in the rating experiment. After hearing each sentence, the 
three of us independently wrote adjectives to describe the processed 
speech. Examples of coder descriptions are: “clear, some noise,” “slightly 
muffled, medium noise,” “crackling noise,” “very hoarse.” 

After considering several other rotations, we chose the solution dis- 
played in Figs. 3 and 4 as most interpretable because the coordinate axes 
nearly or exactly coincide with vectors of measurable quantities and the 
coder descriptions cluster in a meaningful way. With this rotation, the 
proportions of the total variance accounted for by dimensions I, II, and 
III are 0.62, 0.19, and 0.19, respectively. 

Subjective variables. When the descriptive adjectives were related 
to the configuration of points on the plane of the first two dimensions, 
shown in Fig. 3, an interpretation emerged that was reminiscent of a 
similar analysis in a study of analog circuits.!6 The interpretation of the 
space in that study indicated that listeners distinguish among the pro- 
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NOISY 
LOG A/R 


DEGRADED CLEAR 
DISTORTED 


e4 





P SNR-Gg,, 


Fig. 3—Projections of points, representing coders, and vectors, representing measures, 
on the plane of dimensions J and II. 


cessed speech samples according to whether the speech is clear or de- 
graded. When degradation is present, they further distinguish between 
noise in addition to the signal and distortion of the signal itself. The same 
interpretation applies to the first 2 dimensions in the ADPCM coder 
space. The coders we described as having clear speech and little noise 
are high on the first dimension. The coders that were described as noisy, 
muffled, and hoarse are low on the first dimension, and intermediate 
amounts of each type of reduction in overall clarity are distributed be- 
tween these two extremes. Thus, the first dimension appears to represent 
the overall clarity of the speech. 

The plane of dimensions IJ and III, shown in Fig. 4, identifies the 
‘characteristics of the speech that reduce the clarity. The coders that we 
described as noisy are high on the second dimension and those that we 
described as muffled or hoarse are low on the second dimension. Thus, 
the second dimension represents the two kinds of degradations that 
reduce the overall clarity: background noise and distortion of the speech 
signal itself. The conditions we described as very muffled sounded as 
though the speaker had his hand, or some other material object, in front 
of his mouth, and these conditions are high on dimension II. Those 
conditions we described as hoarse sounded as though the speaker had 
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Fig. 4—Projections of points, pepe Anne coders, and vectors, representing measures, 
on the plane of dimensions II an 


laryngitis and, in general, they are low on the third dimension. Two kinds 
of background noise were also identified: one that is described as 
crackling, and one that is more like the familiar white random noise. In 
general, the coders with crackling noise have low values on the third 
dimension and those with white random noise are in an intermediate 
position. Thus, the third dimension appears to represent a further dis- 
tinction between each kind of degradation that could be described as 
rough vs. smooth. Hoarse speech and crackling noise are rough or ir- 
regular in character, muffled speech is smooth or uniform, while speech 
corrupted by white noise is intermediate between these two extremes. 

Objective measures. Figures 3 and 4 also show the vectors corre- 
sponding to various objective measures. The vector SNRgeg is very close 
to the coordinate axis of dimension I and is therefore a good indicator 
of the overall clarity of the processed speech. Log A/R predicts the dis- 
tribution of points on the second dimension, interpreted as the prevalent 
kind of degradation, signal distortion or background noise. A low A/R 
produces a low step size on average, leading to slope overload, perceived 
as signal distortion. On the other hand, a high value of A/R results ina 
high average step size and high granular noise. The locations of the 
vectors SNRGseg and P are also consistent with our interpretation of the 
coordinate axes. They both have high negative weighting on dimension 
II because both reflect the predominant impairment category. High P 
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Fig. 5—Waveforms of the word “tool” for 3-bit ADPCM coders low on dimension II (signal 
distortion), with different values on dimension III, showing the relationship of attack time 
to the smooth or rough subjective descriptions. 


means high overload and substantial distortion; high SNRG;.g means 
low noise. Their non-zero weight on dimension I indicates their influence 
on speech clarity. A high value of SNRGse, indicates low background noise 
and enhanced clarity. Conversely, a high value of P is correlated with 
high distortion and thus with low clarity. The small angle between P and 
SNRGgeg reflects the fact that overload and granularity usually vary re- 
ciprocally in coders with a given number of bits per sample. 

The coordinate axis of dimension III is highly correlated with log T. 
When the attack time is very high, the step size is very slow in following 
fluctuations in input level and the speech sounds muffled. When the 
attack time is very low, the step size frequently overshoots its target value 
at the beginning of pitch periods, causing irregularity in the periodicity 
of the processed speech. This irregularity makes the speech sound 
hoarse. 

These properties are apparent in Figs. 5 to 7 which show waveforms 
that are representative of coder locations in the IJ-III plane. All of them 
display the word “tool” processed by 38-bit ADPCM coders with sub- 
stantially impaired clarity. Figure 5 shows the waveforms of distor- 
tion-producing coders (low weighting on dimension II). With —10 dB 
relative load factor they all produce substantial slope overload in steady 
state. Coder 9 (Fig. 5b) with T, = 32 msec and high weight on dimension 
III is the most muffled; fluctuations in the signal envelope are very 
heavily smoothed. Coder 6 (Fig. 5c), 7, = 4 msec, lower on dimension 
III, reproduces long-term envelope fluctuations, but smoothes out in- 
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nie 6—Waveforms of the word “tool” for 3-bit ADPCM coders high on dimension II 
(background noise), with different values on dimension III. 


dividual pitch periods. Coder 3 (Fig. 5d), T, = 0.25 msec, moderate 
loading on dimension III, reproduces pitch contours but with substantial 
time and amplitude distortion. Figure 6 shows the waveforms of the very 
noisy coders, 4, 7, and 1, with high relative load factors (4 dB) and high 
weighting on dimension II. With low distortion they all preserve the 
general envelope and time structure of the original, so that the nature 
of their impairment is best seen in oscillograms of noise voltages (coder 
output minus input). The two extreme types of noise are displayed in 
Fig. 7. Coder 4, with relatively high weight on dimension III, has 
“smooth” noise which is shown in Fig. 7b to be correlated with the long 
term envelope of the speech. In Fig. 7c, coder 1, with crackling noise 
impairment, low weight on dimension III, is seen to produce impul- 
sive-type noise correlated with the pitch contours of the signal. 
Quality prediction. The vector labeled R corresponds to the mean 
ratings on the 9-point response scale and is very close to the first di- 
mension. As indicated by Fig. 2, a vector corresponding to quality derived 
from the paired-comparison preference judgments would be in essen- 
tially the same location. Multiple regression procedures were used to 
derive linear relationships between the objective measures and subjective 
quality. Table III lists the formulas for predicting the ratings from several 
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Fig. 7—Noise waveforms relative to the word “tool” processed by two coders of Fig. 6, 
with the amplitude scale doubled, showing the influence of attack time on the types of 
noise. 


of the objective measures, singly and in combination. Prediction accuracy 
is indicated by correlations between actual and estimated ratings (1.0 
would be perfect agreement), and by the rms error expressed as a fraction 
of a point on the 9-point scale. Table III shows that, consistent with the 
locations of their vectors relative to the R vector, SNRseg (formula 2) is 
a very good predictor of subject quality and SNR (formula 1) is not a good 
predictor. Among the formulas that contain more than one objective 
measure, the most accurate predictors of subjective quality are 7 and 
8 which include separate measures of granular and overload impair- 
ments. Although the location of the vector corresponding to SNRG is 
essentially the same as that of the SNRGse, vector, prediction accuracy 
is higher when the measurement is made segmentally. 

Other coders. Since formula 8 proved an accurate estimator of the 
subjective quality of the 18 coders in the experiment, we used it to esti- 
mate subjective quality of other ADPCM coders with a wide range of 


Table Ill 

Formula for predicting rating Corr ae 

1 0.21 SNR + 2.27 0.69 1.20 
2 0.31 SNRseg + 0.89 0.93 0.63 
3 0.33 SNRseg — 0.45 log A/R + 0.79 . 0.95 0.54 
4 0.25 SNRG — 0.067 P — 0.39 log T, + 0.22 0.94 0.55 
5 0.24 SNRG — 0.077 P + 0.68 0.93 0.63 
6 0.16 SNRG — 1.11 log T, + 0.47 0.69 1.21 
7 0.24 SNRGseg — 0.078 P — 0.22 log T, + 1.00 0.96 0.45 
8 0.25 SNRGseg — 0.084 P+1.19 0.96 0.48 
9 0.14 SNRGseg — 1.03 log T, + 1.38 0.65 1.27 
10 —0.036 P — 0.26 log T, + 4.97 0.56 1.38 
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es 8—Equi-rating contours predicted by formula 8, Table III, for 4-bit ADPCM 
- coders. 


design parameters. To do so, for each bit rate, we simulated 64 coders 
that comprised all combinations of 8 load factors and 8 time constants. 
The 8 values of each parameter included the 3 tested in the original ex- . 
periment and 5 intermediate values. SNRGseg and P were measured on 
4 sentences, one by each talker, processed through each of the 64 coders. 
The quality ratings, predicted using formula 8 with the averages of the 
measures on the 4 sentences, are displayed in Figs. 8 and 9, which pertain 
to 4-bit and 3-bit coders, respectively. The equi-rating contours show 
that near-optimum quality can be expected over a surprisingly wide range 
of circuit conditions. For instance, with 4-bit coding, a rating of 6.5 (4 
point from optimum on the 9-point scale) is maintained over a7 dB range 
of load factors and a 32:1 range of time constants. 


Vill. DISCUSSION 


Perceptual characteristics. Our interpretation of the 3-dimensional 
subjective space is consistent with previous work!® on analog speech 
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impairments. In both cases, the first two dimensions have the same 
meaning. The third dimension in Ref. 16 was related to loudness. In the 
present experiment, the stimuli were equalized in level, so that subjects 
could attend to less obvious differences, like the “rough” or “smooth” 
character of the impairment. The plane of dimensions II and III, Fig. 
4, provides perhaps the most interesting view of the subjective space. 
Accounting for 38 percent of the variance in the average difference 
judgments, it represents the kind of degradation, independent of the 
amount of degradation. In this plane, the perceptually meaningful 
classifications of ADPCM impairments are the categories, “speech dis- 
tortion” and “background noise” (dimension IJ) and, in addition, the 
types of distortion, “muffled” (smooth) and “hoarse” (rough) and the 
types of noise, “continuous” (smooth) and “crackling” (rough). 

This geometric representation also confirms that the mathematical 
separation of ADPCM performance into static and dynamic response 
categories is perceptually meaningful. Dimension II is highly correlated 
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Fig. 10—Mean ratings for the 18 coders vs. SNRs showing the improvement in prediction 
by measuring SNR segmentally. The solid line is the graph of formula 2, Table III, the re- 
gression of rating on SNRseg. 


with log A/R, a measure of static performance, and dimension III is 
highly correlated with log T,, a measure of dynamic performance. 

Objective measures. Table III indicates that, as in the case of PCM,! 
ADPCM quality is accurately predicted by a linear combination of over- 
load and granularity measures. Formulas 4, 5, 7, and 8, all containing 
separate measures of overload and granularity, are among the 6 good 
predictors of average rating. The table also demonstrates the value of 
measuring signal-to-noise ratio segmentally. Formula 2, which contains 
the single measurement, SNRgeg, is also one of the 6 good predictors. 
Segmental measures give equal importance to strong and weak compo- 
nents of speech, while non-segmental SNR is essentially a measure of the 
quality of the high-level components. The strong correlation of SNRseg 
with average rating indicates that subjective quality judgments are in- 
fluenced by weak sounds as well as strong sounds. 

These properties of SNR and SNRgeg are revealed by Fig. 10 which is 
a scatter plot of average rating vs. both measures for the 18 coders. SNR 
points are labeled with crosses and SNRseg points are labeled with circles. 
The line is the graph of formula 2, the regression of R on SNRseg. The 
coders to the left of the line are, for the most part, those with low A/R, 
in which overload distortion is the predominant impairment. This dis- 
tortion affects only the strong sounds which are the ones that determine 
SNR. The good reproduction of weak sounds by overloaded coders is not 
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reflected by SNR and the crosses for these coders tend to be far to the 
left of the regression line. That is, they give an unduly poor indication 
of quality. By contrast, the points to the right of the line tend to be those 
with high A/R, coders with mainly background noise impairment. This 
degradation is particularly harmful to weak sounds and therefore its 
effect is less on SNR than on SNRseg. Consequently, SNR gives an unduly 
good indication of the quality of these coders. 

SNRseg apparently resembles Q, the objective measure of coder quality 
proposed by D. L. Richards.!” Q is an average of SNR measures per- 
formed with different input levels of a stationary signal. As such it ap- 
parently fails to take into account the dynamic response of a coder, which 
is an important aspect of adaptive quantization. We therefore speculate 
that as an estimator of subjective quality, the accuracy of Q is interme- 
diate between that of SNR and SNRyeg. 

Coder design. The relatively large distances between equi-rating 
contours in Figs. 8 and 9 show that a designer has very substantial lati- 
tude in choosing a coder with a prescribed quality rating. This finding 
is contrary to quality predictions based on conventional SNR measures, 
which indicate that only restricted sets of design parameters offer 
near-optimum performance. This newly discovered design flexibility 
could be valuable in finding coders that simultaneously satisfy criteria 
in addition to the quality of the coding-decoding process. Examples of 
such criteria are quality of tandem connections of codecs, resistance to 
transmission errors, ability to communicate voiceband data, compati- 
bility with other code formats, and economy of implementation. 
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Evaluation of a Word Recognition System Using 
Syntax Analysis 
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A speech recognition system has been implemented which accepts 
reasonably natural English sentences spoken as isolated words. The 
major components of the system are a speaker-dependent word rec- 
ognizer, a programmed grammar, and a syntax analyzer. The system 
permits formulation of complete sentences from a vocabulary of 127 
words. The set of sentences selected for investigation is intended for 
use as requests in an automated travel information system. Results are 
- presented of evaluations for speakers using their own stored reference 
patterns, the reference patterns of other speakers, and composite ref- 
erence patterns averaged over several speakers. For speakers using their 
own reference patterns the median error rate for acoustic recognition 
of the individual words is 11.7 percent. When syntax analysis is applied 
to the complete sentence, word recognition errors can be corrected and 
the error rate reduced to 0.4 percent. 


l. INTRODUCTION 


A speech recognition system composed of a programmed syntax an- 
alyzer and a speaker-dependent word recognizer has been evaluated. The 
system accepts complete sentences in which the successive words are 
spoken distinctly and in isolation. The purpose of the experiment is to 
determine the capability of syntax analysis for improving the accuracy 
of word recognition and for expanding the command ensemble of a 
voice-actuated system. 

The word recognition system, designed by Itakura,' is based on rep- 
resenting speech utterances by equally spaced frames of LPC coefficients. 
Recognition ensues from a comparison of a sample input pattern of LPC 
coefficients with an ensemble of stored reference patterns previously 
established by the designated speaker. The comparison consists of a 
frame-by-frame scan of a sample pattern against each reference pattern. 
A distance metric (or measure of dissimilarity) is calculated and accu- 
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mulated by a dynamic programming technique as the scan proceeds. The 
vocabulary item corresponding to the reference pattern with the lowest 
accumulated distance is designated the recognized item. In addition, a 
distance rejection threshold is imposed. If the accumulated distance 
exceeds the threshold at any frame during a reference scan, that par- 
ticular reference comparison is aborted. If all reference comparisons for 
a sample pattern are aborted, the result is said to be “no match” or 
“reject.” 


Il. EVALUATION OF THE ACOUSTIC ANALYZER 


An earlier evaluation of the automatic word recognition system was 
carried out over a five-month period over dialed-up telephone lines.? 
Thirteen speakers participated in that test. Each dialed the system once 
a day and provided utterances of words selected from an 84-word vo- 
cabulary. The 84-word vocabulary was designed to provide one-word 
responses to questions asked by a computer-controlled digital voice re- 
sponse system. The computer was programmed to provide airline flight 
information requested by a caller. In this system the question-answer 
dialog that takes place between the caller and the computer results in 
the specification of a category of flights for which information is desired. 
Because of the nature of this dialog, 50 of the 84 vocabulary items were 
the names of North American cities. Other entries were digits, days of 
the week, etc. In the evaluation using this vocabulary, with approxi- 
mately 750 trials per speaker, the median word error rate was 8.4 percent. 
This figure is composed of 5.7 percent rejections and 2.7 percent actual 
mismatches.* 


lll. EXPERIMENTAL DESCRIPTION 


The vocabulary selected for the present evaluation was designed to 
fulfill a similar function as that of the earlier one, namely to request flight 
information and to make reservations using an automated system with 
word-recognition capabilities. The difference is that in the present 
system the requests are made in the form of complete sentences rather 
than as one-word responses to queries. The 127-word vocabulary for this 
purpose is shown in Table I together with some sample sentence requests. 
The vocabulary contains many auxiliary and function-type words so that 
reasonably natural English sentences may be formed. The vocabulary 
includes 10 city names. In the earlier mode using the question-answer 
dialog, depending on the complexity of the task, a long series of questions 
may be necessary to specify a complete request. In the present mode, 


* Therefore, the term “word error rate,” as used in this paper, is more appropriately 
defined as the rate of nonrecognition, since it includes both outright errors (mismatches) 
and rejects. 
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Table | — 127-word vocabulary for requesting flight information 
and reservations and two sample sentences 
constructed from this vocabulary 


1 Evening 33 To 65 Reservation 97 Card 

2 Nine 34 Charge 66 A 98 Saturday 
3 October 35 Make 67 Fare 99 Pay 

4 Douglas 36 Home 68 BAC 100 By 

5 DC 37 Five 69 Departure 101 Ten 

6 Arrival 38 Does 70 Of 102 March 

7 Seattle 39 Go 71 Meal 103 Cash 

8 Eleven 40 Seat 72 Flights 104 Miami 

9 Los Angeles 41 From 73 What 105 Thursday 
10 Friday 42 Time 74 1 106 American 
11 January 43 On 75 When 107 Plane 

12 AM 44 December 76 Sunday 108 Bight 

13 April 45 June 77 Boston 109 Club 

14 May 46 Would 78 Arrive 110 Master 
15 Morning 47 Some 79 Twelve 111 Office 
16 Detroit 48 Many 80 Leave 112 My 

17 Do 49 In 81 August 113 Class 

18 New York 50 Please 82 For 114 Six 

19 At 51 Will 83 November 115 Three 
20 Tuesday 52 Lockheed 84 Philadelphia 116 Washington 
21 Oh 53 Want 85 February 117 Night 
22 Wednesday 54 Flight 86 Are 118 Phone 
23 Need 55 Four 87 There 119 Area 

24 Chicago 56 Depart 88 Return 120 Two 

25 September 57 Repeat 89 Coach 121 Code 

26 Is 58 Take 90 O’clock 122 Nonstop 
27 PM 59 Number 91 How 123 Seats 

28 Boeing 60 Denver 92 Much 124 Seven 
29 Information 61 Diners 93 Served 125 Times 
30 Afternoon 62 Prefer 94 Credit 126 Stops 

31 Express 63 July 95 The 127 First 

32 Like 64 Monday 96 One 


Sample test sentences: 

“T would like some information please.” 

“T would like one first-class seat on flight number four four to Los Angeles on Saturday 
the oh one January.” 


the efficiency of natural English is approached by combining several 
commands in a single sentence input. The second sample test sentence 
in Table I is a good example. A more complete description of the task 
domain and the grammar is found in a companion paper by Levin- 
son.? 

Seven speakers—five male, two female—participated in the evaluation. 
The system programs resided in a Data General Nova 840 computer. 
Speech was input to the system via dialed-up telephone lines from an 
ordinary handset adjacent to the computer console. The speakers spoke 
their utterances after a prompt from the console. A display scope pro- 
vided an intensity curve for their current input, together with end-point 
markers. The speakers had the option of repeating an utterance if they 
felt it was botched or corrupted by external noise disturbances. 

Two sessions per speaker were devoted to establishing reference 
patterns. In each of these sessions speakers provided a single utterance 
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of each of the 127 words in the vocabulary. Reference patterns were 
computed from these utterances. Each speaker therefore ended up with 
a reference containing two distinct reference patterns for each word in 
the vocabulary. Speakers could also provide additional optional pro- 
nunciations for the articles ‘‘a” and “the.” 

Finally, each speaker provided one or more test sessions in which a 
total of 51 specified sentences were input as strings of isolated words, 
for a total of 444 words. There was thus an average of 8.7 words per 
sentence. Every word in the vocabulary and every production rule in the 
grammar were represented in the sentence set at least once. The word 
utterances composing the sentence strings for each speaker were stored 
on disk files. 

Recognition was carried out off-line with acoustic recognition followed 
by parsed recognition accomplished by the syntax analyzer. For each 
test sentence the acoustic recognizer provided to the parser a matrix of 
distances or scores [d;;],i = 1,2,...,N, j = 1,2,...,M, where i represents 
the ith word in a sentence string of N < 22 words, and j represents the 
jth vocabulary item in the vocabulary of size M = 127 words. The smaller 
the score d;;, the closer is the acoustic match for the ith word in the 
sentence to the jth word in the vocabulary. Recognition was carried out 
under four different experimental conditions. Two conditions were ex- 
amined in which the utilized references were those for the designated 
speaker. In the first of these a fixed rejection threshold was imposed. In 
the second there was no rejection threshold. With a rejection threshold, 
an arbitrarily large number was assigned to the distance score for each 
rejected candidate. In the third experimental condition each speaker 
was compared against an arbitrarily selected reference. In the fourth 
condition each speaker was compared against a reference which was a 
composite of individual references from four arbitrarily selected male 
speakers. 


IV. RESULTS 


The overall results are shown in Table II as median error rates over 
the seven speakers. 

“Word error—acoustic best candidate” refers to the rate at which the 
specified test word was not the best acoustic candidate. ‘““Word error— 
acoustic five best candidates” refers to the rate at which the specified 
test word was not included among the five best acoustic candidates. The 
recognition scores for word error—acoustic best candidate are quite 
comparable to those obtained in the earlier evaluation. Given the larger 
size of the vocabulary, and especially the greater frequency of common, 
more easily confused words, the 11.7 percent word error performance* 


* Compared to 8.4 percent for an initial trial in the earlier study with an 84-word vo- 
cabulary. 
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Table Il — Median error rates over five speakers with 
51 test sentences per speaker 


3 4 
1 2 Arbitrary Composite 
Condition: Designated Designated male of four male 

Reference: speaker speaker speaker* speakers* 
Rejection threshold: Fixed None Fixed Fixed 
Word Acoustic best candidate: 11.7% 10.8% 45.5% 34.9% 
Acoustic five best candidates: 1.8% 1.1% 20.0% 20.0% 
error | Parsed: 0.4% 1.6% 5.6% 6.5% 
Sentence error parsed: 3.9% 5.9% 35.3% 37.2% 


* Scores include female speakers using male references. 


seems reasonable. As anticipated, the syntactic constraints imposed by 
the task language have a powerful correcting influence on acoustic word 
errors. For example, for condition 1 the median number of word errors 
was reduced from 52 to 2 out of a total of 444.7 Since a single word error 
creates a sentence error and since the number of sentences in the sample 
is relatively small, the parsed sentence error rate is not as reliable an 
indicator of the improvement gained by parsing as the parsed word error 
rate. It is interesting to note that although the acoustic word error rates 
are about the same, with or without a rejection threshold, the parsed 
word and sentence error rates are somewhat larger for the no-rejec- 
tion-threshold condition. We attribute this result to the following pos- 
sible situation. If aspecified word in a sentence string is poorly recognized 
acoustically, in the condition with a rejection threshold it will have the 
same arbitrarily large distance score as other rejected candidates. 
Without a rejection threshold, however, the true word may have a score 
which is considerably worse than other candidates resulting in a greater 
chance of misleading the parser. 

The recognition system was not designed to be speaker-independent. 
We did, however, try a naive experiment in that mode. Table II also 
shows the results of comparing all speakers against an arbitrary refer- 
ence, condition 3, as well as a comparison of all speakers against a com- 
posite reference, condition 4. It was anticipated that, although acoustic 
recognition would be considerably poorer for these conditions than for 
the speaker-dependent condition, the parser would be able to compen- 
sate to some extent this poor performance, resulting in a reasonable 
overall recognition performance. This seems to be true with a 5 or 6 
percent parsed word error rate and 35 or 37 percent parsed sentence error 
rate. Comparing speakers against an arbitrary reference does not seem 
significantly different from comparing them against a composite refer- 
ence. The performance of the two female speakers.in both these condi- 


tTL.e., a reduction of word error rate from 11.7 percent to 0.4 percent! 
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Fig. 1—Individual error rates for the speaker-dependent condition. 


tions was considerably worse than that of the male speakers. The parsed 
word error rate for the women, for example, was approximately 30 per- 
cent.* 

Individual error rates for the speaker-dependent condition are shown 
in Fig. 1. Most striking is the contraction of a large range of acoustic word 
error rates (best candidate) to a very tight range of parsed word error 
rates all below 2 percent. Parsed sentence error rates vary over a wide 
range and are sensitive functions of parsed word error rates. An addi- 
tional indicator of acoustic word performance is acoustic word error rate, 
five best candidates. These rates occupy a considerably reduced and 
overall lower range than the standard acoustic word error rates. This 
measure may be a more reliable predictor of parsed error rates, as shown 
by the monotonic character of the lines that connect these individual 


* The female speakers, therefore, contribute substantially to the error scores for con- 
ditions 3 and 4. This is not surprising in that the references for 3 and 4 were derived from 
male speakers only. 
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WORD AND SENTENCE ERROR RATES 
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Fig. 2—Parsed error rate as a function of acoustic error rate for both words and sen- 
tences. 


rates with the parsed rates. This seems reasonable since the five- 
candidate rate gives a good measure of the quality of the acoustic rec- 
ognizer in the sense of indicating whether the true word has a good 
chance of having a low score. 

Finally, individual parsed error rates collected from all speakers and 
conditions are plotted versus individual acoustic word error rates (best 
candidate) in Fig. 2 to characterize the effectiveness of the parser over 
the widest possible range of performance. This figure is analogous to the 
one in the companion paper by Levinson? which shows simulated results. 
The trends in both figures are the same, but the parsed error rates are 
significantly greater functions of acoustic error rate for the actual rec- 
ognizer than for the simulation. The solid curves drawn have been fitted 
by eye. Note that although there is considerable scatter among the in- 
dividual parsed sentence error rates the trend is almost linear. It is evi- 
dent that even though the parser is a highly effective corrector of acoustic 
word errors this beneficial effect is neutralized to some degree by the 
highly sensitive dependence of sentence error on word error. 


V. CONCLUSION 


The command ensemble for an automatic word recognizer can be 
greatly expanded by forming complete sentences from a relatively 
modest word vocabulary. For applications where speaking discipline can 
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be exercised, complete sentences can be input on a word-by-word basis. 
The present study demonstrates that realistic syntactic constraints can 
dramatically compensate acoustic errors by the use of a well-constructed 
parser. 
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The Effects of Syntactic Analysis on Word 
Recognition Accuracy 


By S. E. LEVINSON 
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In this paper we examine the effects of an algorithm for syntactic 
analysis on word recognition accuracy. The behavior of the algorithm 
is studied by means of a computer simulation. We describe the syntactic 
analysis technique, the problem domain to which it was applied, and 
the details of the simulation. We then present the results of the simu- 
lation and their implications. We find, for example, that an acoustic 
word error rate of 10 percent is reduced to 0.2 percent after syntactic 
analysis, resulting in a sentence error rate of 1 percent. These figures 
are based on a 127-word vocabulary and an average of 10.3 words per 
sentence for 1000 sentences. We expect that these results are indicative 
of the performance which will be attained by a real speech recognition 
system which uses the syntactic analysis algorithm described 
herein. 


I. INTRODUCTION 


The utility and flexibility of a speech recognition system can be sub- 
stantially expanded if it can accept sentence length utterances rather 
than single words. Simultaneously, accuracy can be greatly improved 
by exploiting the grammatical constraints of language on the input 
sentences.!:2.3 

The purpose of this investigation is to establish, by means of a com- 
puter simulation, how much improvement in reliability can be obtained 
by using a particular optimal method for syntactic analysis in conjunc- 
tion with an isolated word recognition system. 

This paper is in five sections. First we give a description of the method 
of syntax analysis under consideration. In the second section we describe 
the content and the semantic and grammatical structure of the problem 
domain to which we intend to apply the analysis. The third section is 
devoted to a description of the simulation, particularly of the acoustic 
recognizer and the procedure for generating random sentences. In the 
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Fig. 1—Block diagram of the speech recognition system. 





fourth section we present the results of the simulation. We conclude with 
an evaluation of the results and a brief discussion of directions for further 
investigations. 

It should be said at the outset that the results of this experiment are 
encouraging. We find that, by using the grammatical constraints for our 
problem domain, the syntax analyzer reduces the acoustic error rate from 
10 percent to 0.2 percent. This results in a sentence error rate of 1 per- 
cent. The sentences were composed from a 127-word vocabulary and 
contained an average of 10.38 words per sentence over 1000 randomly 
generated sentences. 

We believe that these figures, with the more detailed results given in 
Section IV, are indicative of those which will be attained when the 
method of syntax analysis described herein is incorporated into a real 
speech recognition system. 


ll. AN OPTIMAL ALGORITHM FOR SYNTAX ANALYSIS 


The type of speech recognition system we are evaluating is shown in 
Fig. 1, and its operation may be formally described as follows. 

Let the language, L, be the subset of English used in a particular 
speech recognition task. Sentences in L are composed from the vocab- 
ulary, V, consisting of the M words vj,UV9,. . .vyy. Let W be an arbitrary 
sentence in the language. Then we write W € L and 


W = wyWo-++Wp (1) 


where each w; is a vocabulary word which we signify by writing w; € V 
for 1 <i <k. Clearly W contains k words, and we will often denote this 
by writing | W| = &. Similarly the number of sentences in L will be de- 
noted by |L|. 

The sentence W of eq. (1) is encoded in the speech signal x(t) and 
input to the acoustic recognizer from which is obtained the probably 
corrupted string 


W = Wyto--+ wy (2) 


where w; € V for 1 <i <k but W is not, in general, a sentence in L. 

The acoustic recognizer also produces the matrix [d;;] whose ijth entry, 
d;;, is the distance, as measured by some metric in an appropriate pattern 
space, from the ith word, Ww; to the prototype for the jth vocabulary word, 
vj, forl<i1<kandi<j <M. 
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Fig. 2—Example state transition diagram. 


The syntax analyzer then produces the string 
W = Wytdo-++ Wy (3) 
for which the total distance, D(W), given by 


_ Rk 
D(W) = & dy, 1<j;<M (4) 
i=1 


is minimized subject to the constraint that W e L. Thus the syntax 
analysis is optimal in the sense of minimum distance. 

Since, in general, W ¢ L, whereas W was assumed to be grammatically 
well-formed (i.e. W € L), the process should correct word recognition 
errors. 

In principle one could minimize the objective function of eq. (4) by 
computing D(W) V W e€ L and choosing the smallest value. In practice, 
when |L| is large, this is impossible. One must perform the optimization 
efficiently. It has been shown by Lipton and Synder‘ that for a particular 
class of languages one can minimize D(W) in time proportional to | W|. 
In fact one can optimize any reasonable objective function in time linear 
in the length of the input. 
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The particular class of languages for which the efficiency can be at- 
tained is called the class of Regular languages. For the purposes of this 
discussion we shall define the class of Regular languages as that class for 
which each member language can be represented by an abstract graph 
called a state transition diagram. 

A state transition diagram consists of a finite set of vertices or states, 
Q, and a set of edges or transitions connecting the states. Each such edge 
is labeled with some vu; € V. The exact manner of the interconnection 
of states is specified symbolically by a transition function, 6, where 


6:(Q X V) > Q (5) 


That is, if a state g; € @ is connected to another state q; € Q by an edge 
labeled v,, € V then 


6(Gi,Um) = Gj (6) 


We also define a set of accepting states, Z Cc Q, which has the significance 

that a string W = wyw2---w,, where w; € V for 1 <i < &, is a well- 

formed sentence in the language, L, represented by the state transition 

diagram if and only if there is a path starting at q; and terminating in 

some q; € Z whose edges are labeled, in order, w1,Wg, «++ Wp. 
Alternatively we may write W e L iff 


61(q1,W1) = Gj, 
62(qj,W2) = jo 


: (7) 


bn (Qjp-pWk) = Qj, € Z 


We may then define the language, L, as the set of all W satisfying eq. 
(7). An example of these concepts is shown in Fig. 2. The accepting states 
are marked by asterisks. 

While the definition given above of a Regular language is mathema- 
tically rigorous, it is not the standard one used in the literature on formal 
language theory but rather has been specifically tailored to the notational 
requirements of this paper. The interested reader is urged to refer to 
Hopcroft and Ullman!° for a standard and complete introduction to 
formal language theory. 

In the following discussion we shall restrict ourselves to finite Regular 
languages, i.e., those for which |L| is finite. This restriction in no way 
alters the theory but its practical importance will become obvious in what 
follows. The finiteness of the language implies that its state transition 
diagram has no circuits, i.e., no paths of any length starting and ending 
at the same state. Thus there is some maximum sentence length which 
we shall denote, | max. 
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We now turn to the problem of efficiently solving the minimization 
problem of eq. (4). To do this we shall define two data structures ® and 
W which will be used to store the estimates of D (W) and W, respec- 
tively. 

The first stage of the algorithm is the initialization procedure in which 
we set 


0 forg=qoandi=0 
B:(q) = | a 


W;(q) =0 1<i<|W|=k; VaeEQ (8) 


The data structures have two indices. The subscript is the position of 
the word in the sentence and the argument in parentheses refers to the 
state so that the storage required for each array is, at most, the product 
of lmax + 1 and |Q|, the number of states in the set Q. 

After initialization we utilize a dynamic programming technique de- 
fined by the following recursion relations: 


o otherwise 


®;(q) = min {B;-1(Qp) + dij} (9) 
where the set A is given by: 
A = {5(qp,vj) = q} (10) 
Then 
Wi(q) = Vi-1(qp) dj (11) 


where w; is just the v; which minimizes ®;(q). Equation (11) is under- 
stood to mean that the word w; is simply appended to to the string 
Wj-1 (q Pp ). 

Unfortunately the concatenation operation is not easily implemented 
on general purpose computers so we change the recursion of eq. (11) by 
making V into a linked list structure of the form: 


WVii(q) = Ip 
Voi(q) = v; (12) 


Then when i = k we can trace back through the linked list of eq. (12) and 
construct the sentence W as follows: First find q; € Z such that 


; (qs) = min {#;(q)} (13) 
qeZ 
set q = qf and then fori =k,k — 1,k — 2,...,1 
wb; = Voi(q) 
q = Wii(q) (14) 
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Table | — Example [d,] matrix 


Code — Vocabulary word IT=1 IT=2 I=3 IT=4 I=5 
1 Is 9 9 1 8 4 
2 Fare 2 2 5 3 1 
3 I 7 3 2 3 2 
4 Want 2 9 4 7 3 
5 Would 1 5 4 8 2 
6 Like 2 5 2 6 5 
7 Some 2 1 9 8 3 
8 Information 7 7 4 7 8 
9 Please 2 3 2 4 9 

10 0 4 5 8 1 7 
ll Make 6 3 9 8 5 
12 A 4 7 6 9 8 
13 Reservation 3 6 7 8 9 
14 Return 9 7 6 4 8 
15 The 8 6 5 2 3 
16 Morning 3 4 5 6 7 
17 First 8 6 8 7 5 
18 Class 5 5 4 3 9 
19 Seat 9 9 8 7 3 
20 Non-stop 3 3 4 5 8 
21 Flight 9 8 3 5 6 
22 Will 6 7 7 6 5 
23 Pay 4 4 4 4 3 
24 In 3 3 3 6 9 
25 Cash 5 4 3 7 6 
26 How 2 9 8 7 5 
27 Much 6 2 8 4 9 
28 Need 7 6 5 4 3 


Thus the sentence W is computed from right to left. 

The operation of the above algorithm is illustrated in Tables I, II, and 
III. Table I shows the vocabulary words of the language diagrammed in 
Fig. 2 along with numerical codes and a sample [d;;] matrix. Table II 
shows the details of the operation of the algorithm for i = 0,1,2. Table 
III shows the results after the sentence has been completely analyzed. 

By locating the smallest entry in each column of the sample [d;/] 
matrix of Table I, it can be seen that the acoustic transcription of the 
sentence from which this matrix was produced is: WOULD SOME IS TO 
FARE. Clearly this is not a valid English sentence nor is there any path 
through the state transition diagram of Fig. 2 whose edges are so la- 
beled. 

Following Table II the reader can trace the operation of the algorithm 
as it computes the valid sentence having the smallest total distance. First 
the @ and W arrays are initialized according to eq. (8). To make the figure 
easier to read, this has been shown only fori = 0. 

Note that there are two transitions from state 1; one to state 2 labeled 
Tand the other to state 8 labeled HOW. Accordingly ,(2) is set to 7, the 
metric for I; ¥1;(2) is set to 1, the state at the beginning of the transition 
and Wo, is set to 3, the code for the transition label I. Similarly ;(8) is 
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Table Il — Detailed operation of the first three stages of the 
algorithm WANT 






Position 
Code Word 1 
3 I 7 3 
4 Want 2 9 
28 Need 7 6 
5 Would 1 5 
22 Will 6 7 
26 How 2 9 
27 Much 6 2 
8 





SBenwoo 8B 


Table Ill — Final results of the algorithm 





set to 2, the metric for HOW; V1;(8) is set to 1 as before and V2; (8) is set 
to 26, the code for HOW. All other entries remain unchanged. 

In the next stage more transitions become possible. Note in particular 
that there are two possible transitions from state 2 to state 3. In accor- 
dance with eq. (9), the one labeled NEED is chosen since it results in the 
smallest total distance, 13, which is entered in &9(3); V1(3) is set to 2, 
the previous state, and W99(3) is set to 28, the code for NEED. Transitions 
to state 7, 9, and 22 are also permissible and thus these columns are filled 
in according to the same procedure. 
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The completion of this phase of the algorithm results in @ and W as 
shown in Table III. We can now trace back according to eqs. (13) and (14) 
to find W. From Fig. 2 we see that there are two accepting states, 5 and 
6. &;(6) is 8 which is less than 30, the value of @5(5), so we start tracing 
back from state 6. The optimal state sequence is, in reverse order, 
6,11,10,9,8,1. The corresponding word codes which, when reversed, de- 
code to the sentence: HOW MUCH IS THE FARE. 

One final note: from the operation of the algorithm it should be clear 
that it is not necessary to retain $;(q) for.0 < i < |W|. At the ith stage 
one needs only $;_;(q) to compute ®;(q). Thus the storage requirements 
are nearly halved in the actual implementation. 

In closing we should note that this scheme is formally the same as 
(though conceptually different from) the Viterbi® algorithm and similar 
to methods used by Baker® and stochastic parsing techniques discussed 
in Fu’ and Paz.® The crucial difference is that in the cited references, 
estimates of transitions probabilities are used whereas in this method 
the transitions are deterministic and the probabilities used are only those 
conditioned on the input x(t). 


Il. THE SEMANTIC AND GRAMMATICAL STRUCTURE OF THE 
RECOGNITION TASK 

In this section we shall discuss the application of the abstractions of 
the previous section to a particular speech recognition problem domain. 
The task which was finally selected was that of an airline information 
and reservation system. The choice was made for three reasons. First, 
the problem is difficult enough so that even under some artificial con- 
straints, it is a significant test of the above described techniques. Second, 
previous work by Rosenberg and Itakura® which used single words rather 
than sentences composed of isolated words as input was available for 
purposes of comparison. Third, it affords the opportunity to add modes 
of human/machine communication such as speaker verification and voice 
response. 

The semantics of the language we designed limits a user to the fol- 
lowing types of messages. First, one may state the desired kind of 
transaction (i.e., requesting flight information or making a reservation). 
Then, one may make a reservation either by providing all necessary in- 
formation in one sentence or by giving answers to questions as required. 
The user may select arrival and/or departure dates, times and cities, 
number of stops, number and class of seats, specific flight numbers and 
aircraft types. Alternatively, the user may ask questions about arrival 
and/or departure dates, times and locations of specific flights, the type 
of aircraft, the number of stops, the fare, the number of meals served and 
the flight time. Finally, he may request a repeat of any information or 
supply telephone numbers and method of payment. 
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For most of the above messages there are several acceptable gram- 
matical structures of sentences conveying the same semantic informa- 
tion. 

In order to limit the complexity of the syntactic analysis, certain ar- 
bitrary constraints were imposed: 

(1) Dates consist of two digits followed by the name of the month. 

(it) Flight numbers are limited to one or two digits. 

(iti) The vocabulary includes the names of only ten cities. 

(tv) The length of the longest sentences is 22 words. 

It was felt that these constraints could all be relaxed if so desired without 
making major system modifications. 

Next we give an informal specification of the syntactic structure. In 
this description phrases enclosed in curly brackets are alternatives. 
Those enclosed in square brackets are optional, while those within angle 
brackets represent a class of words of the indicated type. 


| WANT r SOME INFORMATION | fe isaay 
WOULD LIKE] | TO MAKE A RESERVATION] /P 
GO 
WANT LEAVE ; 
I [FROM (city) ][TO (city)] 
WOULD LIKE RETURN 
DEPART 
MORNING 
AFTERNOON 
[on][(day) ] [THE (date)] 
EVENING 
NIGHT 
AT WHAT TIME . : 
| DO FLIGHTS LEAVE [(city)] FOR (city) 
WHEN 
ARE THERE ; 
HOW MANY FLIGHTS | | [FROM (city) ] 
MORNING 
. AFTERNOON 
TO (city) ON [day] [THE (date) | 
EVENING 
NIGHT 


WHAT PLANE IS ON FLIGHT (flightnumber) [TO (city) | 


WHAT 


IS THE FARE [FROM (city)] [TO (city) ] 
HOW MUCH 


IS A MEAL SERVED ON [THE] FLIGHT [(flightnumber) |[TO (city) | 
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WANT 
I { WOULD LIKE) FLIGHT [NUMBER] (flightnumber) 


WILL TAKE 
MORNING 
; AFTERNOON 
[TO (city)] ] ON (day) [THE (date) ] 
NIGHT 
EVENING 
WANT 
aS: FIRST CLASS 
I / NEED (digit) | || SEAT (s]. 
COACH 
WOULD LIKE 


I PREFER THE [(manufacturer)] (aircraft type). 


a.m. 
WANT 
I TOGO AT (hour) {p.m. 
WOULD LIKE 
O’CLOCK 
(flightnumber) 


FARE 
ARRIVAL TIME [s] 
PLEASE REPEAT THE { DEPARTURE 


PLANE 
NUMBER OF MEALS 
AT WHAT TIME EN DaSaee! 
| DOES FLIGHT (flightnumber) 
WHEN 


FROM : ARRIVE 
[ro |‘ |lopare 
TO DEPART 
CASH 
AMERICAN EXPRESS 


I WILL PAY BY 
DINERS CLUB 
MASTER CHARGE 
HOME 
Y | PHONE [NUMBER] IS [AREA CODE] 
OFFICE 


[<area code number)] (phone number) 


| WANT 


A NON-STOP FLIGHT 
WOULD LIKE 
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HOW MANY STOPS ARE THERE ON FLIGHT (flight number) 
MORNING 


AFTERNOON 
fees (city) ON (day) [THE (DATE) | 
TO EVENING 


NIGHT 


WHAT IS THE FLIGHT TIME [FROM (city)] [TO (city)] 


The above description is too informal to define every detail of the 
Flight Information Language. It should, however, give the reader a 
feeling for the basic syntax and semantics. This specification of the 
language is quite useless for the purpose of the syntax analysis algorithm. 
For that purpose we have produced a formal specification of the lan- 
guage, the state transition diagram for which is shown in Fig. 3. From 
this graph it may be seen that | V| = 127; |6] = 450; |Q| = 144 and |Z| 
= 21 with the accepting states being designated by asterisk. 


IV. DETAILS OF THE SIMULATION 


The simulation of the speech recognition system based upon the 
analysis described above may be treated as three separate problems. 
They are: (1) Random generation of many well-formed sentences, (ii) 
computing a [d,;] matrix for each in such a way that the number of 
acoustic errors resulting from a nearest-neighbor decision rule is con- 
trollable, and (iii) syntactically analyzing the sentences and tabulating 
the appropriate statistics automatically. We shall now discuss these in 
order. 

The method for generating random sentences in the language is just 
the following algorithm: | 

(() W<-O;q; < q1 (W gets the null string) 

at the ith stage, 
(1) Chose a word, wz; € V 
(iti) If 6(q;,we;) ¥ qj; for some Jj go to (2) 
else W <- W wz, (W gets itself concatenated with W;,) 


gi ~~ qj 
(iv) Ifg € Zand p < 6 where 
p is a pseudorandom number and @ is some threshold, STOP; else 
go to (2) 
When the procedure terminates, W = w,;,Wz,. . We; | < Imax. Obviously 
by changing the threshold, 0, one can vary the average length of the 
sentences produced. In the actual simulation, p was uniformly distrib- 
uted on (1/4, —4) and 6 was set to —0.25 producing an average sentence 
length of 10.3 words. 
In all, 42,000 such sentences were generated. In addition we made up 
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Fig. 3—State transition diagram of the Flight Information and Reservation Lan- 
guage. 
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Fig. 3 (continued) 


an additional 171 sentences averaging 9.7 words in length to use as a 
check against the randomly generated sentences. The additional sen- 
tences comprised several realistic transactions between airline customers 
and an automated flight information and reservation system. 
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The procedure for generating a [d;;] matrix simulating one which 
might be produced by acoustic recognition for the randomly selected 
sentence W = w w2:-- Ww; is as follows. Corresponding to each v; € V 
we assign the five-dimensional Gaussian density function 


pj(&) = (2x)-5/2] U|-V/2e- V2 jr)? (15) 


where the covariance matrix, U, was chosen, for convenience, to be 


oO 
o 0 
U= 0 o (16) 
2 
my) 
for selected values of o2. The mean vectors 
mj a (m1;,M9j,1M3;,7M4;,M5;) 
were fixed by selecting the mj;; at random from 
2 
mj= {1 forl</1<5;1<j <|V] (17) 
0 


until the 127 mean vectors were defined. Then for w; = v;, a random 
vector ¥ was drawn from p;(<) and distances were computed according 
to: 

d= ||5— jl | forl<j<|VI (18) 


where the norm is the simple Euclidean distance. Equation (18) was 
evaluated for 1 <i <& thus all entries in the [d;;] matrix were computed 
for each sentence W. 
The acoustic recognition was simulated by a nearest neighbor rule so 
that w; = v; if 
di; <di, forl<n< |V| (19) 


Again, eq. (19) was applied for 1 <i <2 and ties were arbitrarily broken. 
Clearly by changing the value of o? in eqs. (15) and (16) the simulated 
acoustic error rate can be varied with small values of o? producing low 
error rates. 

Given the foregoing discussion, description of the simulation is quite 
simple. A set of 1000 random sentences was generated and its distance 
matrices computed. The sentences were syntactically analyzed and errors 
counted. This was done for 0.05 < o < 2.1 with o being incremented by 
0.05 for each set of 1000 sentences. 

In addition, the specially formulated 171 sentences were typed in and 
processed. In this case o was fixed at a value of 0.245 which resulted in 
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Fig. 4—Error rates as a function of o, 


-an acoustic error rate of 11 percent which is close to the value observed 
by Rosenberg and Itakura® for a similar word recognition task. Error 
rates were measured for these sentences as well. 


V. SIMULATION RESULTS 


The overall results of the simulation are encouraging, showing con- 
siderable improvement in word and sentence recognition accuracy. Given 
an acoustic error rate of 10 percent on 1000 randomly generated sen- 
tences averaging 10.3 words per sentence, syntactic analysis reduces the 
word error rate to 0.2 percent resulting in a sentence error rate of 1 
percent. A sentence is in error if even one word is improperly classified. 
For the test set of 171 sentences containing 1662 words, an 11 percent 
acoustic word error rate was lowered to 0.2 percent after syntactic 
analysis resulting in a 1.2 percent sentence error rate. The actual time 
required to analyze a 22-word sentence on the Data General Nova 840 
is a small fraction of a second. 

Details of the results are best given in the accompanying figures. 
Figure 4 is a plot of acoustic and syntactic word error probabilities as a 
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Fig. 5—Word and sentence error rates as a function of acoustic error rate. 


function of o of eqs. (15) and (16). Each point marked with a symbol is 
an observed data point based on the results obtained from 1000 sen- 
tences. It should be noted that the results for each point are based upon 
different sentences. The solid lines are obtained from a nonlinear 
least-squares fit of the data to the equation 


a ao 
= : 20 
c agag + (ay — agag)e~@1° (20) 


The method used in fitting the data is described by McCalla.1! The curve 
of eq. (20) is called a logistic curve; its significance is discussed in detail 
by Braun.!? For the simulated data the standard deviation of the actual 
data from the fitted curve was <0.005. 

Figure 5 shows the word and sentence error probabilities as a function 
of the acoustic word error probability. Once again the marked points are 
derived from sets of 1000 randomly generated sentences while the solid 
curves are obtained by fitting the data to an exponential curve of the 


form 


Pep = ayer2Pe (21) 
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Once again the resulting fits were good having a standard deviation 
of <0.004. 


VI. CONCLUSIONS 


It is clear from the results of the simulation that the syntax analysis 
algorithm is very fast and effective in eliminating word recognition errors 
which occur at the acoustic level. It is expected that the performance of 
the algorithm for real speech input will depend on the characteristics 
of the acoustic recognizer and the task language. However, we believe 
that our results are indicative of the performance which can be attained 
in real speech recognition systems. 

There are several areas for further research which are immediately 
suggested by this work. Although this entire presentation has been ori- 
ented toward the grammatical structure of sentences, the method de- 
scribed is certainly not restricted to that area. For example, the phonemic 
structure of words can be specified by a formal language as can the 
composition of acoustic features into phenomes. We therefore feel that 
optimal syntax analysis methods will be useful in more difficult speech 
recognition tasks than the one described here. 

Another useful extension of the technique would be achieved by re- 
taining the same optimality criterion while relaxing the restriction that 
|W| = |W|. In other words, the algorithm would be allowed to insert and 
delete words. This could be an important aid in the solution of the seg- 
mentation problem in continuous speech. 

On the theoretical side, it would be enlightening to derive analytical 
expressions for the average probability of error for the syntax analyzer, 
given the properties of the language and a characterization of the acoustic 
recognizer. Perhaps for this purpose the entropy or redundancy of the 
language might be sufficient, while the acoustic recognizer might be 
viewed as a noisy channel and characterized by its equivocation or ca- 
pacity. In any event, it seems obvious that an information theoretic 
analysis would provide insights into the behavior of speech recognition 
systems. 

Finally we note that Regular languages are the most simple syntactic 
structures. One naturally wonders whether efficient, optimal methods 
exist for formal languages of much greater complexity which would be 
better models of Natural Language. 

In summary, we may say that optimal syntactic analysis techniques 
are useful and powerful tools to be used in tractable speech recognition 
tasks as well as being interesting mathematical objects. 
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A Combinatorial Lemma and Its Application to 
Concentrating Trees of Discrete-Time Queues 


By J. A. MORRISON 


(Manuscript received November 11, 1977) 


Concentrating rooted tree networks of discrete-time single server 
queues, all with unit service time, are considered. Such networks occur 
as subnetworks connecting remote access terminals to a node in a data 
communications network. It is shown that the network of queues may 
be replaced by a single queue, with prescribed input, which has the same 
output as the queue at the root of the tree. The result is applied, in 
particular, to the case of several queues in tandem, and it is shown how 
this problem may be reduced to that of just two queues in tandem. The 
latter problem was analyzed earlier by the author. 


I. INTRODUCTION 


In this paper we consider concentrating rooted tree networks of dis- 
crete-time single server queues, all with unit service time. Such networks 
occur as subnetworks connecting remote access terminals to a node in 
a data communications network.! Our purpose is to show that the rooted 
tree network of queues may be replaced by a single queue, with pre- 
scribed input, which has the same output as the queue at the root of the 
tree. In particular, the result is applied to the case of queues in tan- 
dem. 

In Section II we consider the pooling of data from M buffers into a 
single buffer, which also receives data from another source, as depicted 
in Fig. 1. We establish a combinatorial lemma which shows that there 
is a single equivalent buffer, with prescribed input, and the same output 
as the buffer in which the data is pooled. It is then pointed out how this 
result may be applied to a concentrating rooted tree network of queues, 
such as the one depicted in Fig. 3. A related observation was made by 
Kaspi and Rubinovitch? in connection with networks of continuous time 
queues involving the pooling of data from inputs with idle periods that 
are exponentially distributed. 
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Fig. 1—Schematic of pooling of data from several buffers into a single buffer. 


In Section III we consider the repeated application of the lemma to 
the case of several queues in tandem, as depicted in Fig. 4. It is shown 
how this problem may be reduced to that of just two queues in tandem, 
so that the results obtained for that problem? may be applied. Specifi- 
cally, in the case that the input processes 2, i = 1,..., 7, are mutually 
independent, and each process is a sequence of independent identically 
distributed nonnegative integer valued random variables, the generating 
function of the steady state distribution of the content of each buffer 
in Fig. 4 may be determined. Also, under the assumption that all arrivals 
take place at the end of a unit time interval, the average waiting time in 
each queue may be obtained. 


Il. COMBINATORIAL LEMMA 


We first consider the pooling of data from M buffers into a single 
buffer, which also receives data from another source, as depicted in Fig. 
1. It is assumed that a buffer transmits one packet, the basic unit of data, 
in a unit time interval, provided that it is not empty, and that the buffers 
are of unlimited size. Let bY, ] = 1,..., M,denote thecontents of the 
M buffers at time n, and let x¥) denote the corresponding number of 
packets entering the buffers in the time interval (n,n + 1]. We define 


1, £2 1,2,..5 
0, 2=0. 


Then the contents of the buffers at time (n + 1) are given by the equa- 
tions 


U(é) = | (1) 


b= by SUG) rans, J= hese Ms (2) 


forn = 0,1,2,.... It is assumed that the initial contents by”, as well as 
the inputs x, are nonnegative integers. 
The outputs of the M buffers enter another buffer, the content of 
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which at time n is denoted by cp. Also, the number of packets entering’ 
this other buffer in the time interval (n,n + 1] from another source is 
denoted by y,. Then the content of this buffer at time (n + 1) is given 
by the equation 


M : 
Cnt+1 = Cn — U(en) a U(b%) + Yn, (3) 
j=l 


forn =0,1,2,....Itis assumed that the initial content co, as well as the 
inputs yy, are nonnegative integers. We now show that there is a single 
equivalent buffer, with prescribed input, which has the same output. 

Let e, denote the content of the equivalent buffer at time n, and de- 
fine 


€o = Co, 

M_. (4) 

en = > [DP —x9 J t+en, n=1,2,.... 
j=l 


Further, we define the inputs 


M 3 
wo = > by + yo, 
i= (5) 
M ; 
Wn = 2 tea fe Vp WPS eet 
= 


Then we have the following 
Lemma 1. Subject to (1)-(5), en is a nonnegative integer, and 


U(en) = Ulen), 


(6) 
Cnt+1 = en — Ulen) tun, n=0,1,2,.... 
Proof: It follows from (2)—(4) that 
M_. 
eri be hea Ue) Pin. WH 042 Gate (7) 


j=l 

But, from (1), 4 — U(2£) = 0. Hence e,,+ 1 is a nonnegative integer for n 
= 0,1,2,..., and so is €9 = Co, by assumption. Moreover, e,+1 = 0 implies 
that bY =0,j=1,...,M,c, = U(c,) and y, = 0, and hence, from (3), 
that c,4) = 0. On the other hand, c,,+1 = 0 also implies that bY = 0,7 = 
1,...,M,c, = U(e,) and y, = 0, and hence, from (7), that e,41 = 0. 
Therefore U(e,41) = U(en41), n = 0,1,2,..., and U(eo) = U(co) since 
€o = Co. Finally, from (7), with the help of (4) and (5), 


€n+1 = en — Ulen) tun, n=0,1,2,... _ (8) 
Since we have just shown that U(e,) = U(c,), n = 0,1,2,..., this com- 


pletes the proof of the lemma. 
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Fig. 2—(a) Queue, Q, fed by two independent input streams, and (b) Queue, Qs, fed by 
the outputs of two queues, Q; and Qo, which are fed separately by the two input 
streams. 


STREAM 2 


In the particular case M = 2 and y,, = 0, Ziegler and Schilling* obtained 
arelated result, not. restricted to discrete-time queues. They considered 
single server queues with identical constant service times, but assumed 
that the interarrival times between packets for each of the two inde- 
pendent input streams were governed by some general probability dis- 
tribution. They compared a queue, Q, fed directly by the two input 
streams, and a queue, Qs, fed by the outputs of two queues, Q; and Qo, 
which are fed separately by the two input streams, as depicted in Fig. 
2a and b. They established that the number of packets serviced at Q 
during its jth busy period is equal to the number serviced at Q3 during 
its jth busy period, and hence that the jth idle periods at Q and Q3 have 
the same duration. Note that in the discrete-time case we have shown 
that U(e,) = U(c,), so that the corresponding buffers are empty at the 
same times. 

Returning to our lemma, the result may be applied to concentrating 
rooted tree networks of discrete-time single server queues with unit 
service time, such as the network depicted in Fig. 3. The queues Q1, Qe 
and 3 may be replaced by a single equivalent queue, 3 say, which has 
a prescribed input sequence, 2°) say, and the same output as Q3. Then, 
by asecond application of the lemma, the queues Q3, Q, Qs and Qg may 
be replaced by a single equivalent queue, Q¢ say, which has a prescribed 
input sequence, 2 say, and the same output as Qs. Thus the rooted tree 
network of Fig. 3 may be replaced by a single queue with the same output 
and prescribed input. In the next section we consider the repeated ap- 
plication of the lemma to several queues in tandem. 


Il. TANDEM QUEUES 


We now consider J discrete-time single server queues, with unit service 
times, in tandem, as depicted in Fig. 4. The output of buffer 7 enters 
bufferi +1,fori =1,...,7—1.Letd®,i=1,..., I,denote the content 
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Fig. 3—Example of a concentrating rooted tree network of queues. 


of buffer i at time n, and let z\ denote the corresponding number of 
packets entering the buffer from a source in the time interval (n,n + 1]. 
For convenience, we define d = 0. Then the content of buffer i at time 
n + Lis given by the equation 


dy = d® — U(d®) + Ulas-) +29, (9) 
forn =0,1,2,...,andi =1,..., J. Itisassumed that the initial contents 


d®, as well as the inputs z”, are nonnegative integers. 
Let 


6g ede. FS had =A, (10) 
and 
eM=d n=0,1,2,.... (11) 
Moreover, define 
2 ip oi eee 
p= Ye . (12) 
dim 4+ yo 2,, n=0,...,6-2, 
k=i-n 


fort =1,...,J, and let 
eft D = eb + gH) — yO ,, (13) 
forn =1,2,...,andi=1,...,7—1. Then we have the following 
Lemma 2. Subject to (9)-(13), e is a nonnegative integer, and 
U(ew) = U(d)), 


efit, =e — Ue) +0®, 


(14) 
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forn =0,1,2,...,andi =1,...,I. 


Proof: Since d® = 0, it follows by definition, from (9), (11), and (12), that 
the lemma holds for 1 = 1. We proceed by induction on 1. We assume that 
the lemma holds for some i < J, and will show that it holds fori + 1. We 
identify e®, d&9, e@) vy and z@” with b, cn, en, x? and yn, re- 
spectively, for n = 0,1,2,....The induction hypothesis then implies (2), 
with M = 1, and, from (9), 


dy de) = Ua) FUP) 42a, (15) 


and hence (3), with M = 1. Moreover, (10) and (13) imply that (4) holds, 
with M = 1. Hence, from Lemma 1, with M = 1, it follows that et” is 
a nonnegative integer, and 


U(el*) = U(dS*Y), n=0,1,2,.... (16) 
Also, using (5), 
eft) = ef) — U(elth) + ef + 2htD, (17) 
and 
eft D = eG D — Ue D) + pO, 4 24) (18) 
forn =1,2,.... 
But, from (10)-(12), 
ef) + 24D = dW + 2ht) = yftD, (19) 


Also, fori = 2andn =1,...,i-—1, 
vO, +26 aagre |S (2ncicith = One. (20) 
k=i-n+ 
Finally, forn =i,1+1,..., 


ii 
vO), +20V= 2 2) nig, = OE, (21) 


Hence, from (17)—(21), 


ent = en — Ulen*?) + unt”, (22) 
for n = 0,1,2,.... In view of (16), this completes the proof by induc- 


tion. 
From (9) and (14) we have the following 
Corollary. For n = 0,1,2,...,andi=1,...,J-—1, 
et, = el — Ule) + v®, 


23) - 
dtp = a — UAH) + U(e®) + 26. “4 
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Fig. 4—Schematic of several queues in tandem. 
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Thus, by replacing the first i queues in Fig. 4 by a single equivalent 
queue, with the same output as the ith queue, we have reduced the 
problem of several queues in tandem to that of just two in tandem. 

Suppose now that the input processes 2“, i = 1,..., J, aremutually 
independent, and that each is independently and identically distributed 
(i.i.d.), with 


E(s?@) = 6;(s), i=1,...,Z. (24) 
Then, from (12), 


E(s°®) = [I das), n=i-1Lji,..., (25) 
k=1 


and the input processes v) and z+ are mutually independent, and each 
is iid. The problem of two queues in tandem was investigated recently,? 
and the results are applicable to (23). The generating function of the 
steady state distribution of the contents of the two buffers was calcu- 
lated, under the assumption that the mean combined input rate from 
the two sources is less than unity. Accordingly, we assume that 


I 
> EGP) <1. (26) 
i=1 
Then we may use (23) to calculate the generating function of the steady 
state distribution of the content of each buffer in Fig. 4. The initial values 
vw... vo, fori = 2, do not affect the steady state distributions. 

A particular example was considered,? in which the input to the first 
queue is geometrically distributed, while the input from the source into 
the second queue is either 0 or 1, with fixed probabilities. The steady 
state probability that the content of the second buffer exceeds m was 
calculated, and asymptotic results were derived for m > 1. It would be 
of interest to carry out an analogous derivation for the case of Poisson 
inputs to both queues. The results would be applicable to the case of 
Poisson inputs into J queues in tandem, corresponding to ¢;(s) = 
exp[A;(s — 1)] in (24). Then, from (25), the input process v{ is also 
Poisson, forn =i — 1,i,..., with parameter D4,-; Ap. 

Formulas were derived? for the average waiting times in two queues 
in tandem, under the assumption that all arrivals take place at the end 
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of a unit time interval. The average waiting time in the second queue was 
taken over all arrivals to that queue, both from the source and from the 
first queue. The results may be applied to (23), to obtain the average 
waiting times in each of the J queues in Fig. 4. The averages are over all 
arrivals to each queue. 
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Pulse Dispersion Properties of Fibers 
with Various Material Constituents 


By L. G. COHEN, F. V. DiIMARCELLO, J. W. FLEMING, 
W. G. FRENCH, J. R. SIMPSON, and E. WEISZMANN 


(Manuscript received November 11, 1977) 


Intermodal dispersion properties are compared for high silica fibers 
with borosilicate (B203-SiO2) and germania borosilicate (GeOo- 
B203-SiO2) graded-index profiles. Pulse transmission measurements 
were systematically correlated with profile shapes so that new fibers 
could be fabricated with closer-to-optimal profile gradients at a 
wavelength of 907.5 nanometers. Germania borosilicate fibers with 
power law profile exponents (a ~ 2.03) lowered intermodal dispersion 
50 times from the result expected for comparable step-index fibers with 
N.A. ~ 0.19. By contrast, borosilicate fibers with a ~ 1.78 caused a 
100-fold pulse width reduction in fibers with N.A. ~ 0.14, corresponding 
to a 20 = 0.13 ns/km pulse-broadening rate. 


I. INTRODUCTION 


Dispersive refractive index differences between material constituents 
(GeOo, BoOz, and SiO») for germania borosilicate core fibers cause modal 
group velocity differences to depend on the source wavelength (profile 
dispersion). Therefore, nonparabolic profile gradients are generally re- 
quired to minimize pulse dispersion. 

Previous time-domain transmission measurements!” were used to 
direct the fabrication of a nearly optimal borosilicate fiber. This paper 
compares more recent and extensive data for germania borosilicate fibers 
and borosilicate fibers at \ = 907.5 nm wavelength corresponding to a 
GaAs injection laser. We have also characterized small profile undula- 
tions about a nearly optimal power law shape which degrade intermodal 
dispersion characteristics. Another paper? describes how profile dis- 
persion influences fiber bandwidth properties at other wavelengths. 
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Fig. 1—Refractive-index profiles derived from interference micrographs: (a) a boro- 
silicate graded-core fiber; (b) a germania borosilicate core fiber. The fitted curves are de- 
termined from a weighted [eq. (1)] nonlinear least-squares fit to the data ®. 


ll. FIBER FABRICATION AND PROFILE EVALUATION 


The borosilicate fibers in this study have a uniform 1 BoOs3-6 SiO» 
cladding composition and a core in which the B2O3 concentration de- 
creases from 14 mole percent to 0 percent at the center.4 Germania bo- 
rosilicate fibers have a uniform SiO, cladding, a thin (2 um) 1 ByOs3- 
9 SiOs barrier layer and a core in which the GeO» concentration increases 
from 0 percent to 8.5 mole percent at the center where the material 
composition is 2 GeO -1 B2O3—21 SiO». The modified chemical vapor 
deposition process® is used to deposit the appropriate glass compositions 
by the reaction of BCls, GeCl4, and SiCl, with oxygen at a temperature 
of 1400-1700°C inside a fused quartz substrate tube. In the case of the 
borosilicate fibers, the borosilicate cladding is first deposited followed 
by the graded borosilicate core. The germania borosilicate fibers are 
typically prepared by depositing 2 borosilicate barrier layers followed 
by 40-50 graded core layers. The fused quartz support tube is the clad- 
ding in this case. The radial index profile is graded by a programmed 
variation of the chloride dopant concentrations in the reaction stream. 
After the composite substrate tube is collapsed into a solid preform 
structure and then drawn into fiber, the resultant index profiles are 
determined by interference microscopy of thin fiber cross sections.® 7 

Figure 1b shows a profile for a typical germania borosilicate fiber. The 
dip in the center is caused by GeO2 dopant burn-off during the collapse 
stage. This type of distortion does not appear in boron graded fibers (Fig. 
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la) because the dopant concentration is very small at the core center. 
The dip at either edge of the germania borosilicate profile corresponds 
to the borosilicate layer at the core-cladding interface. The index profile 
parameter, a, is determined by fitting power law profiles to the measured 
curve. The least-mean-square fit profile is determined from weighted 
differences between the measured and optimal curves according to: 





1/2 
Weighted % dev. = E J (AN meas — AN power law)” Wir a 
=1 j 
100 
x 1 
AN max 
where 
Wr;) =r? (1 — r7)3/? 
(2) 


AN wawet law = IN max (1 -r?) ‘ 


The AN’s are index differences and r; is the distance from the core center 
divided by the core radius. The weighting function W(r,) [eq. (2)] was 
determined® by computing the approximate number of modes that are 
confined to various sections of the profile. This reduces the importance 
of the core center, r; = 0, since only the lowest-order modes are confined 
there and of the profile tail, 7; = +1, which is only important to high- 
order modes. Maximum weight is placed on the radial region midway 
between the core and cladding. After obtaining a best fit a, using the 
weighting function W(r), we determine the unweighted deviations of 
the data from the calculated profile. Typical unweighted deviations range 
frorn 1 to 3 percent from the least-mean-square fit power law profile. 
However, recent theoretical calculations? have shown that 1 percent 
profile deviations can cause order-of-magnitude pulsewidth increases 
from the optimal 2c (min). 

Profile distortions can also be illustrated through scanning electron 
photomicrographs of etched fiber cross sections as in Fig. 2. The ridged 
structure midway between core and cladding is in the vicinity of the 
maximum gradient slope. This ridged distortion is observed in both 
boron and germania doped fibers, but it appears more prominent in the 
germania doped fibers due to the high doping in the center where the 
layers are farther apart. These distortions are due to differences in vol- 
atility of the glass components which cause concentration variations in 
each layer deposited by MCVD. 

The concentration of core dopants may be measured directly by the 
use of electron-beam x-ray microanalysis techniques.!° An ETEC scan- 
ning electron microscope equipped with a KEVEX energy dispersive x-ray 
spectrometer has been used to measure the germania concentration in 
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(a) B,03 DOPED FIBERS 





(b) GeO, DOPED FIBERS 


Fig. 2—Scanning electron photomicrographs of etched fiber cross sections illustrate 
dopant concentration profiles for a boron graded profile and a germania graded profile. 


germania borosilicate core fibers as shown in Fig. 3. The high resolution 
of this technique generates data which more accurately represent the 
central germania depletion region than the index profile data (Fig. 1b) 
measured using thin section interference microscopy. The ridged 
structure seen in the germania borosilicate fiber is coincident with re- 
gions of GeO» concentration fluctuation. The germania concentration 
profile characterizes a fiber core in the same way as the refractive index 
profile, provided the effect of the boron is either independent of radius 
or negligible. Fitting the weighted power law function to the concen- 
tration profile shown in Fig. 3, one obtains an a of 2.12. 
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Fig. 3—Germania composition profile for a typical germania borosilicate fiber (the re- 
fractive index profile for this same fiber is shown in Fig. 1b). 


lll. PULSE TRANSMISSION MEASUREMENTS ATX = 907.5 nm 


Pulse dispersion is characterized at \ = 907.5 nm by injecting impulses 
of light (3 dB width = 0.3 nsec, full rms pulsewidth 20 = 0.4 nsec) from 
a GaAs laser and measuring the broadened fiber output pulsewidth. The 
optical shuttle pulse technique?! is used to make length-dependent 
pulsewidth measurements by reflecting propagating light back and forth 
between partially transparent mirrors at the ends of a fiber. Mode- 
mixing effects were relatively small in all the tested fibers since 
pulsewidths increased with an almost linear dependence for multi- 
kilometer path lengths. Therefore, output pulse broadening was pri- 
marily caused by intermodal dispersion, profile dispersion, and material 
dispersion effects due to relative time delays between the source spectral 
components within its 2.5—-3 nm bandwidth. 

Material dispersion effects were reduced by a narrowband interference 
filter which has a 1.4 nm bandwidth approximately centered about the 
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Fig. 4—Material dispersion effects on output pulsewidths from borosilicate fibers (a) 
and (b). The bottom row of photographs represents pulse propagation of unfiltered GaAs 
Jaser light, and the top row represents the propagation of spectrally filtered laser light with 
bandwidth Ad ~ 1.4 nm centered about Ao ~ 907.5 nm. 


laser line peak at \ = 907.5 nm. The filtered source spectral band- 
width!2.!3 should cause 2c = 0.09 nsec/km full rms width pulse spreading 
in borosilicate fibers and 0.11 nsec/km pulse spreading in germania 
borosilicate fibers. These kinds of effects are clearly illustrated in Fig. 
4 for shuttle pulse extrapolated lengths of 9.27 km for fiber (a) (Z = 1.03 
km) and 7.56 km for fiber (b) (LZ = 1.08 km). The bottom row of photo- 
graphs shows output pulses due to unfiltered laser light, and the top row 
of photographs shows pulses due to spectrally filtered light. Results for 
fiber (a) show that when the source spectral bandwidth is cut in half, the 
output pulsewidth is reduced by 20 percent. Pulse outputs from fiber 
(b) show how material dispersion effects mask intermodal effects by 
smoothing the impulse response. A multipeak pulse structure is recov- 
ered by narrowing the laser linewidth. 

Far-field spatial filters? (circles and annular rings) are used to measure 
time-of-flight differences between high- and low-order modes arriving 
at the fiber output. If high-order modes arrive before low-order modes, 
the fiber profile is overcompensated because a < a(opt). When high- 
order modes arrive last, the profile is undercompensated because a > 
a(opt). Output pulses from fibers with nearly optimal profiles are not 
altered in shape by spatial ray filters or changed launch conditions. This 
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Fig. 5—Root-mean-square pulsewidth reduction factor a(step)/a(graded) = 
[ANL/ (12)!/2c] 1/o(graded) is plotted vs. a for germania borosilicate and borosilicate fibers. 
Data points (O for GBS fibers; « for BS fibers) weré obtained by deconvolving expected 
material dispersion effects (0.12 ns/km for GBS fibers, 0.09 ns/km for BS fibers) from the 
measured fiber output pulsewidths. 


type of profile diagnosis has proved to be a very useful guide for fabri- 
cating new fibers with close-to-optimal profile gradients. 

Figure 5 summarizes our pulse dispersion data with a plot of the rms 
pulsewidth reduction factor, o(step)/o(graded), relative to comparable 
step-index fibers, as a function of a for fibers with different profile 
gradients. Pulsewidth measurement precision is 27 < 0.07 ns/km because 
the optical shuttle pulse technique is used to extrapolate 1 km fiber 
sample lengths by an order of magnitude. A germania avalanche diode 
with a response time (20) of 0.65 nsec was used as the detector, and all 
such system broadening was deconvolved from the measured pulse- 
widths. Data points ¢ were obtained from borosilicate fibers with graded 
BoO3-SiO» cores and uniform By03-SiO» claddings. The three peak data 
points, which correspond to pulse dispersions of 0.13, 0.14, and 0.15 
nsec/km, are results which show that nearly optimal borosilicate fibers 
can be repeatably fabricated with 20 < 0.2 ns/km. The optimal profile 
at A = 907.5 nm is characterized by a(opt) ~ 1.78, and the minimum 
measured pulse dispersion, 20 = 0.13 ns/km, represents a 100-fold re- 
duction from the theoretical rms pulse spreading in a step-index fiber 
with AN ~ 0.0067 core-to-cladding index difference. 

Data points O were obtained from germania borosilicate fibers with 
a graded GeO» concentration and a nearly uniform B2O3 concentration 
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Fig. 6—A plot of the rms deviation, of AN vs. r data, from a power law profile vs. pulse 
dispersion, 2c, for fibers with near-optimum a values. 


across the core. Small amounts of B.O3 are added to reduce the viscosity 
of the core glass, thereby aiding the elimination of bubbles. The two peak 
O data points occur when /a ~ 2.03 and correspond to 20 = 0.4 ns/km and 
0.48 ns/km dispersion values. They represent a 50- and 48-fold reduction 
from the theoretical rms pulse broadening in equivalent step-index fibers 
with AN ~ 0.018. The optimal a values for each type of fiber are con- 
sistent with the values predicted by Fleming!? on the basis of refractive 
index measurements. 

The theoretical minimum pulse broadening in an epee graded- 
index fiber is characterized by:!4 


2o(min) ~ 141(AN)? (3) 
The theoretical maximum pulsewidth reduction factor is given by: 


( o(step) ) 1 ANL/c 14 


o(graded) ~ V2 o(min)L : AN (4) 


which is inversely proportional to AN, the maximum core-to-cladding 
index difference. Therefore, the ratio of 2 between the peak pulsewidth 
reduction factors for the borosilicate and germania borosilicate fibers 
in Fig. 5 is consistent with the fact that AN(GBS) ~ 2AN(BS). However, 
the measured pulsewidths are 18 to 30 times greater than the minimum 
values predicted by eq. (3) [2c(min) ~ 0.007 ns/km for borosilicate fibers 
with AN ~ 0.007; 2o(min) ~ 0.024 ns/km for germania borosilicate fibers 
with AN ~ 0.013]. 
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Two possible reasons for the measured dispersion being so much larger 
than the theoretical minimum value are (1) the profile a value varies 
along the length of the fiber, or (ii) the profile is not exactly a power law 
shape. Each of these differences probably applies to some degree. We 
have attempted to correlate the rms deviations from a power law profile 
with the pulse dispersion for fibers which have nearly optimal a values.9 
The influence of the refractive index dip in the germania-doped fiber 
profile was too large for us to obtain any meaningful comparison. In the 
case of the borosilicate fibers, which have a smoother profile, a rough 
correlation can be found. Figure 6 demonstrates this possible correlation 
for fibers with a = 1.8 + 0.05. These data are far from conclusive, but it 
is clear that further significant pulsewidth reduction will require im- 
proved control of the refractive index profile so that it conforms to the 
correct power law function more exactly than the present fibers. De- 
velopment of these improvements will require substantial improvements 
in MCVD deposition control and in fiber profile measurement. 


IV. CONCLUSIONS 


Pulse transmission properties have been compared for two of the most 
common types of high silica graded-index fibers. A spectral filter was 
used to reduce material dispersion effects caused by a GaAs injection 
laser at \ = 907.5 nm and the optical shuttle pulse technique was used 
to make precise intermodal pulse dispersion measurements. 

Nearly optimal profile gradients are characterized by a ~ 2.03 in 
germania borosilicate fibers and by a ~ 1.78 in borosilicate fibers. Re- 
sultant pulsewidth reduction factors are approximately 50 for germania 
graded fibers with AN ~ 0.013 (N.A. ~ 0.19) and 100 for boron graded 
fibers with AN ~ 0.0067 (N.A. ~ 0.14). The factor-of-2 ratio between the 
peak pulsewidth reduction factors for the two kinds of fiber is consistent 
with the fact that AN(GBS) + 2AN(BS). However, the measured pulse- 
widths are 18 to 30 times greater than the minimum values predicted 
by 2o(min) ~ 300A? ns/km. Further significant pulsewidth reductions 
will require improved profile control to reduce by an order of magnitude 
current 1-3 percent rms deviations between fabricated profiles and 
optimum power law shapes. 

The results in this paper apply at a design wavelength, \ = 907.5 nm, 
corresponding to a GaAs injection laser. A companion paper? describes 
how profile dispersion affects fiber transmission bandwidths at other 
wavelengths. 
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The mechanisms of interference at voice-band frequencies from a 
power distribution system which adversely affect the telephone loop 
plant are systematically described. A unified derivation is presented 
of simple lumped-element circuit models for telephone plant coupling, 
shielding, and longitudinal-to-metallic conversion. This approach 
establishes both qualitative understanding and quantitative analytical 
tools for characterizing the effects of low frequency interference. A 
glossary is included which represents a consensus evaluation of the best 
contemporary relationship between historical terminology and modern 
analytical viewpoints. 


I. INTRODUCTION 


This paper explores a systematic approach for understanding the 
electromagnetic interaction between power and -telephone systems. 
Historically, the various mechanisms of coupling, shielding, and longi- 
tudinal-to-metallic conversion have evolved into separately addressed 
concerns. This has led to useful insight but somewhat narrow under- 
standing, since the interdependence of the various concepts has received 
limited consideration. Moreover, since some of the classical treatments 
extend back over five decades, they are sometimes difficult to read owing 
to variations in terminology and basic units. We wish to provide a co- 
hesive overview that emphasizes the interrelationship among these topics 
within a modern analytical setting. This approach, using concepts fa- 
miliar to recent engineering graduates, unifies historical developments 
and provides a basis for understanding current viewpoints toward re- 
ducing power-telephone interaction. 

Although transmission line theory is briefly touched upon as a starting 
point, the basic framework consists of an analytical model that utilizes 
only lumped-element circuit theory. This lumped-element general an- 
alytical model serves the following purposes. First, it ties together within 
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one framework a description of the various physical mechanisms that 
have previously been treated separately. Second, it readily lends itself 
to systematic computer evaluation of the electromagnetic interaction 
between these mechanisms. Finally, from this general model is derived 
several specialized circuit representations of a sufficiently simple nature 
to furnish maximum physical insight. These specialized circuits highlight 
the specific and well-known physical mechanisms of inductive, capaci- 
tive, and dissipative coupling, inductive shielding, and longitudinal- 
to-metallic conversion. A practioner may utilize the more comprehensive 
analytical model, or he may wish to adopt the specialized circuit repre- 
sentations as his basis for understanding, depending on the extent of his 
concerns. The general analytical model retains its conceptual value as 
the origin of a single unified approach since it assures that the specialized 
models provide a consistent description. The usefulness of the various 
models is indicated in the summary section. 


Il. GENERAL ANALYTICAL MODEL 


Although electromagnetic theory forms a fundamental core from 
which all macroscopic electrical behavior can be derived, it would be a 
rather remote starting point for the purposes of this discussion. On the 
other hand, the overall generality of our useful circuit models might well 
go unappreciated or else be questioned in the absence of a clear under- 
standing of their origin. To strike a balance, this paper will adopt 
transmission line theory as a starting point, then move quickly to more 
familiar lumped-element circuit equations and models. 

The multiconductor transmission line theory is derivable from 
Maxwell’s equations with remarkably few restrictions,! although in 
common textbooks generality is often swapped for expediency. Many 
intricate details involving specialized electromagnetic analysis can be 
succinctly summarized as transmission line parameters. The presence 
of all skin effect phenomena,” both within the metal conductors and more 
importantly the resistive earth, can be rigorously accounted for by the 
transmission line parameters. Since the circuit theory equations and 
associated models are evolved from transmission line theory, they too 
can account for all skin effect phenomena. Such phenomena manifest 
themselves in the form of circuit parameters that are no longer fre- 
quency-independent; i.e., the R, L, G, and C can take on frequency 
dependencies in accordance with rigorous solutions to electromagnetic 
boundary value problems. In this way, full advantage is taken of the 
_ relative simplicity and usefulness of lumped-element circuit theory while, 
at the same time, maintaining substantial generality. 


2.1 Transmission line synopsis 


A rather concise physical interpretation is stated here to aid in the 
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Fig. 1—Induction by electric and magnetic fields from disturbing into disturbed con- 
ductor. 


understanding of transmission line equations and to instill confidence 
in their usefulness. While an understanding of these differential equa- 
tions is highly desirable, their physical content carries over to the simpler 
algebraic equations to be introduced in the next section. 

Consider two conductors located at fixed heights in relation to the 
surface of a resistive medium as illustrated in Fig. 1. Electromagnetic 
induction or coupling among parallel conductors may be classified as 
either transverse or longitudinal with respect to the conductor axes. 
Transverse coupling characterizes an electric force acting at right angles 
to the conductor and medium. This perpendicular force arises from an 
excess of charge that is proportional to conductor potential, V. Since the 
force acts to drain off and thereby deplete conductor current in the 
amount of —dJ within an incremental distance of dz, this coupling 
mechanism is described quantitatively by 


de YV. (1) 
The proportionality factor Y is called an incremental transverse ad- 
mittance (mhos/meter). Longitudinal coupling, on the other hand, 
characterizes an electric force in the direction of the conductor axes. This 
axial force arises from the movement of charge that is proportional to 
conductor current, J. Since the axial electric force tends to decrease 
conductor potential with an increasing z, this coupling mechanism is 


described by 


a ZI. (2) 
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The proportionality factor Z is called an incremental longitudinal im- 
pedance (ohms/meter). 

Equations (1) and (2) may be recognized as the simple transmission 
line equations found in basic textbooks.? Such treatments often define 
V as the voltage difference between conductors 1 and 2 and take the I 
of conductor 1 to return entirely through conductor 2. This convention 
is appropriate when the two conductors are energized to form just one 
(metallic) circuit. More generally, a second (longitudinal) circuit is able 
to coexist on the two conductors. All possible circuits can be systemati- 
cally taken into account by using the following reference convention. The 
voltage on each separate conductor is defined with respect to a common 
reference, in this case the finitely conducting earth which forms a “‘re- 
mote ground,” as implied by the electric field lines in Fig. 1. Moreover, 
the current in each conductor is defined as having total “earth return,” 
quite irrespective of the circuit’s actual completion path. The relation- 
ship between this systematic “earth-return” reference convention and 
the useful longitudinal and metallic convention is more fully explored 
in Appendix B. 

Equations (1) and (2) apply implicitly to multiconductor circuits. In 
this case V and J are taken as column vectors whose components asso- 
ciate with each individual conductor, while Y and Z are taken as square 
symmetric matrices. The off-diagonal matrix elements represent “mu- 
tual coupling” effects, whereas the “‘self-reaction” of each conductor is 
characterized by the diagonal matrix elements. To illustrate this point, 
observe the matrix differential equations that characterize the two 
conductors in Fig. 1. The explicit matrix form of eq. (1) is 


-2U)-Ee Si} 
dz Io Yo Yoo Vo , 


ata ae =| Fat (4) 
dz Vo ZL 21 Zz 22 I 2 

Each matrix equation is simply a compact notational expedient for 
representing a system of M coupled individual equations, where M (two 


in this case) is the number of conductors in the multiconductor trans- 
mission line. For instance, expanding this last matrix equation yields 


for eq. (2) it is 


dV 
—~— = Zh + Zyle 
dz 
dv. 
— 2 = Zoi, + Zools, (5) 
dz 


where Zj. (equal to Z2; from reciprocity) represents the longitudinal 
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mutual impedance/unit length which couples conductors 1 and 2. Matrix 
notation is exceedingly useful in subsequent developments and well 
worthy of the effort required to acquire familiarity. With this notation 
it becomes straightforward to systematically account for the interaction 
of neutral and multiple phase wires of the power system with the strand, 
sheath, and metallic circuit twisted pair conductors of the telephone 
system. 


2.2 Segment model 


A fortunate simplification arises owing to the telephone loop plant 
not being “long” when measured in relation to the wavelength of voice- 
band interference frequencies. A typical loop can be subdivided into a 
minimal number of electrically short segments, each of which is char- 
acterized by a fixed geometrical configuration. This allows the trans- 
mission line differential eqs. (1) and (2) to be replaced by much simpler 
lumped-element circuit equations. In addition, an equivalent circuit 
representation can be identified that will form the basis for all subse- 
quent analyses. The more important details of this simplification are 
outlined below. 

Consider an exposure segment of power and telephone system con- 
ductors with a uniform geometrical configuration and extending a length, 
Aé, between two locations identified as j andj + 1. The need for AZ to 
be electrically short is generally not restricting at voiceband interference 
frequencies; this point is addressed more quantitatively in Appendix 
C. A general segment consisting of M individual conductors is illustrated 
schematically in Fig. 2a. Some conductors represent the power system 
neutral and phase wires; the remaining conductors can characterize such 
telephone system wires as a support strand, cable sheath, and twisted 
voice-circuit pairs. The nature of coaxial or cable-sheath-enclosed con- 
ductors is totally characterized by the numerical value of individual el- 
ements within the incremental impedance and admittance matrices. 
Moreover, these matrix elements also reflect spacing and height infor- 
mation. Since the conductor configurations may change from one seg- 
ment to another, this variation will be identified by the superscript J, 
j + 1, on the incremental matrices that are applicable between locations 
j andj +1. The column vectors representing voltage and current vari- 
ables will also carry an appropriate superscript. The integer subscripts 
continue to represent specific conductors within matrices and column 
vectors. 

In terms of the matrix notation described above, eq. (2) may be ac- 
curately approximated as 


_ (Vi#1— Vi) 


o = Zistl [i741 (6) 


LOW-FREQUENCY INTERFERENCE 1667 


where J/-/+1 is the current in the center of segment j,j + 1. Similarly, 
using an average value of voltage, eq. (1) becomes 

(1h + 1g") _ ai pe (Vi + Vit} 

Ae 2 

In the above equation, the decrease in longitudinal current, —dI, between 
j and j + 1 has been identified with the transverse currents, I/, and Ij*}, 
which flow after and before the location identified by superscript. (These 
are the so-called charging currents associated with distributed capaci- 
tance of aerial cable, although they might also represent current flow 
due to distributed conductance of direct buried cable.) It is convenient 
to define total impedance and admittance matrices such that all elements 
of the incremental matrices are multiplied by the segment length, A?@: 


(7) 


Ziitls= ZiIt1Ae (8) 
Yi itl = YIIt1Ag, (9) 
Then, upon algebraic rearrangement eq. (6) becomes 
longitudinal 
Vi — Vitl = Zit) Ji3+1 impedance |: (10) 
coupling 


Similarly, the two independent terms on the right side of eq. (7) yield 


Ii, = Yi Vi transverse 
admittance ]» (11a) 
[ftis yptt pitt coupling 
| (1b) 
where 
yi = yj =l, Visitl (12) 


has been defined in this decomposition. The circuit theory eqs. (10) and 
(11) are the desired replacements for the transmission line differential 
eqs. (1) and (2). 

The foregoing basic circuit relationships completely characterize the 
physical coupling mechanisms, both longitudinal and transverse, within 
each segment of a power and telephone exposure. These equations lead 
directly to the equivalent circuit represented in a compact matrix form 
in Fig. 2b. The voltage and current variables are represented by column 
vectors, whose elements correspond to variables shown in Fig. 2a. The 
impedance and admittance elements represent square matrices. Observe 
that half the total segment admittance has been associated with the 
centers of each half-segment, occurring after location j and before 
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Fig. 2—Interaction model for multiconductor segments and general terminal constraint. 
(a) Nomenclature for matrix representation. (b) Equivalent circuit in compact matrix 
form. 


location J + 1, in accordance with eq. (12). The total current which de- 
parts location j and flows into segment j, j + 1 is identifiable as 


I= [4541 + Hi, (13) 


whereas the total current which enters location j + 1 from this segment 
is given by 


Ti+) = Ji,i+1 — fth (14) 


In summary, all pertinent equations are implied by the equivalent circuit 
in Fig. 2b when analyzed with matrix algebra. 


2.3 Constraint characterization 


Now that a multiconductor exposure segment has been completely 
characterized in quantitative analytical terms starting from transmission 
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line concepts, the terminations or boundary conditions imposed at each 
end of a segment must be considered. The variety of terminations en- 
countered in practice is rather substantial. For instance, the power 
system conductors will have load impedances and discrete grounding 
impedances attached at various locations. Generators will feed the power 
line from at least one end and possibly at two or more locations. The 
telephone system shielding conductors may be either independently 
grounded or resistively coupled to the power line ground. This resistive 
coupling may arise from the interaction of closely spaced ground rods 
or from direct bonding to the power system neutral conductor. 

Each of these termination constraints could be implemented on a 
case-by-case basis for those relatively simple configurations that involve 
just a few conductors and one or two segments. Once the boundary 
conditions are characterized with individual circuits, the conglomerate 
network including the segment representation(s) must then be analyzed. 
There is a wide variety of network analysis techniques from which an 
efficient method can be selected. Generally, either mesh or node analysis 
is convenient for most simple networks. 

Rather involved network configurations arise when considering the 
interactive effects of multiple shielding conductors and several cascaded 
exposure segments. For these cases it becomes highly desirable to invoke 
systematic network analysis techniques that lend themselves to 
straightforward computer implementation. It is possible to handle the 
wide variety of segment terminations (assumed linear) in a single ver- 
satile model by utilizing a Norton equivalent circuit to represent the 
general terminal constraint. When cast in matrix notation, these 
boundary conditions take the form: 


general 
he Si Aa terminal ]- (15) 


constraint 


This approach has been illustrated in Fig. 2 by including a Norton 
terminal constraint for an arbitrary location k. With a Norton repre- 
sentation, the ideal short circuit current sources, J*,, simply vanish for 
the special case of passive terminations, while the off-diagonal mutual 
terms in the admittance constraint matrix, Y*, readily account for any 
resistive coupling from nonindependent grounding. Details concerning 
the explicit form of the termination circuit have been suppressed into 
an implicit equivalent characterization. This will serve to systematize 
the subsequent network analysis procedure. The motivation for choosing 
a Norton instead of a Thevenin equivalent circuit will be clarified in the 
following section. 
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(b) 


Fig. 3—Matrix representation of network model for many cascaded segments. (a) 
Simplification at adjoining segments. (b) Composite circuit for N — 1 segments. 


2.4 Cascaded configuration 


Circuit models have been derived for a multiconductor exposure 
segment and for a general terminal constraint in the preceding sections. 
With this building-block approach in mind, these circuits can now be 
combined to form the general network illustrated in Fig. 3. This network 
enables analyzing the effects of many exposure segments connected in 
cascade, while incorporating various grounding and bonding constraints 
at the segment interfaces. Thus, it becomes a fundamental tool from 
which to assess electromagnetic interaction between the power distri- 
bution system and telephone loop plant. This interaction varies with the 
configuration changes that occur along the loop plant. Physical pa- 
rameters amenable to analysis include power-line geometry and sepa- 
ration, size and type of shielding conductors, intervals between grounding 
and bonding points, quality of grounds and bonds, and degradation of 
sheath continuity. The presence of such devices as drainage reactors and 
neutralizing transformers can also be accounted for. Since this network 
model has basic importance for both the derivation of several specialized 
circuits in the next section, as well as computerized algorithms that allow 
detailed parametric studies, a brief description follows. 

A useful simplification is contained within the network model at ad- 
joining exposure segments as indicated in Fig. 3a. Note that the two 
segment admittances, Y/, and Yj, adjacent to location j, can be combined 
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with the constraint admittance, Y/ at location j, by simple matrix ad- 
dition to form a total admittance, Y/. 


yi= Yl+Yi+ Yi. (16) 


With this reduction the total transverse current, I’, which is “bled-off” 
the longitudinal current flow, constitutes the column vector addition 
of currents associated with the individual transverse admittance ma- 
trices. 


Ve V+iH+l. (17) 


A knowledge of the node voltage, V’, is sufficient to reconstruct the in- 
dividual transverse current contributions via multiplication with the 
appropriate admittance matrix, as detailed in eqs. (11) and (15). Hence, 
the network analysis can now focus primarily upon determining V/ and 
Ij.J*1 for each location and segment. This network simplification is a 
direct result of having chosen the Norton equivalent circuit to charac- 
terize a general terminal constraint. 

The foregoing simplification allows the overall network to be repre- 
sented as shown in Fig. 3b. Assumed known in this model are the pa- 
rameters for the Norton termination constraints, as well as the segment 
admittance and impedance matrices. Further discussion of these ma- 
trices can be found in Appendix A. An analytical solution for the network 
model will determine the unknown voltages and currents in terms of the 
remaining known circuit parameters. An algorithm that provides an 
efficient solution can be obtained by analyzing the composite circuit 
using ladder network techniques.° A computer program centered around 
such an algorithm has been developed and utilized extensively to ex- 
amine the parametric dependence of shielding. 


Ill. SPECIALIZED CIRCUIT REPRESENTATIONS 


Particular physical mechanisms associated with just certain con- 
ductors of a larger network will be the focal point of this discussion. The 
influence of the remaining parts of the network upon these chosen 
conductors can usually be characterized by dependent (i.e., controlled) 
sources. These sources contain information about the remaining circuitry 
and succinctly characterize its interaction with the chosen conductors. 
This approach makes it easier to understand the interaction with the 
remaining circuitry resulting from specific physical mechanisms. 
Moreover, it points the way to certain measurements that can be used 
to characterize a complex network. With these objectives in mind, this 
section develops specialized circuit models for coupling, shielding, and 
longitudinal-to-metallic conversion. 

This approach furnishes only specialized characterizations of various 
physical mechanisms; it does not remove the overall system interde- 
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pendence among these mechanisms. Numerical values for the dependent 
sources, which represent the effects of the remaining network, may be 
partly influenced by parameters associated with the few chosen con- 
ductors. This interactive behavior can best be accounted for by an 
analysis of the total network. Such a solution may be simply formulated 
through the use of mutually interacting specialized circuits. These circuit 
representations serve as building blocks to construct the total network, 
and hence, its total interaction. This approach facilitates a direct analysis 
without the need of general matrix algebra formulations described in 
the previous section. 

On the other hand, the interaction between the dependent sources 
and parameters of the chosen conductor(s) may sometimes be weak, or 
of “second-order” effect. In these instances simplifying approximations 
are appropriate, and quite limited measurements may suffice to char- 
acterize specific physical mechanisms. It is difficult, of course, to know 
when such approximations are valid, short of either actually performing 
the complete analysis or collecting extensive measurement data. In 
simplifying approximations, practical experience and/or intuition gained 
from prior analyses should be of value in establishing sound engineering 
judgment. 


3.1 General coupling model 


To simplify the discussion, a single conductor within an exposure 
segment can be selected for examining the longitudinal and transverse 
coupling to the remaining conductors within the segment. Let the chosen 
conductor be labeled i to denote any one of the several conductors 
numbered 1 through M within the segment. Moreover, consider the 
segment that connects arbitrary locations j andj + 1. 

The longitudinal impedance type of coupling within segment j, 7 + 
1 is characterized by matrix eq. (10). In its explicit form, this equation 
reads 


V4 vir Zi, Zin +2 Ly ses Zim |i 
V3 vit Zo, Zong +2 Lo ++ Lom |{le 
vil jv . Zi Zig +2 “Li ee: Lim | {Li te) 
V4 Viti ZmMi1 Zm2 +: Z4mi +*> Zum|{lu 


The superscript notation has been omitted from the elements of Z//+1 
and [/J*1, since only one segment is under discussion. The difference 
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in voltage to remote ground between locations j and j + 1 for the ith 
conductor is given by multiplying the ith row of matrix Z//*+! with col- 
umn vector J//+1, 


vi - yi = Zi], + Ziolo tee? + Zul; +--+ Zinn. (19) 


This equation for longitudinal voltage drop is made up of two kinds of 
terms. 

(i) It is easy to recognize the voltage drop due to the self-impedance 
of conductor 1 as Z;;J;. 

(ii) There are several mutual impedance terms, each of which accounts 
for longitudinal voltage induced in conductor i because of current flowing 
in some other conductor, e.g., k, as Zizlp. 

Hence, rewriting eq. (19) to keep these terms separate gives 


Vi — Vit) = ZiT; + Voi, (20a) 


where 


M 
Vsi = a Zirlr. (20b) 
(hai) 

The summation term above accounts for all effects associated with 
longitudinal coupling of the remaining conductors within the segment. 
When these conductors are relatively close, the mutual impedances, Z;z, 
consist dominantly of positive reactance, and the coupling is referred 
to as inductive. For conductors having large spacing, the Z;, parameters 
are both frequency-independent and dominantly resistive, which con- 
stitutes one form of dissipative coupling. The numerical dependence of 
Zjz upon conductor spacing, frequency, and earth composition is more 
fully described elsewhere.® What is important here is that these Z;, pa- 
rameters quantify both inductive coupling and resistive coupling. 

In a comparable manner, the transverse admittance type of coupling 
within segment j, 7 + 1 is characterized by matrix eqs. (11a) and (11b). 
With the aid of eq. (12), the explicit form of eq. (11a) reads 


Thy Yu Yio sts Yue Yim | {V4 
The Yor Yoo +++ Yor +++ Yom || VS 


: = il : 
i; fe Yn Yio cee Yuoss: Yim | {V} on) 


Thy Yui Yue «:: Yumi -:: Yum} iV\y 
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The superscript notation has again been omitted from the elements of 
Y/j+1, but must be retained for the column vectors J/ and V/ to dis- 
tinguish between eqs. (11a) and (11b). The transverse current, I/,,, which 
flows after location j from the ith conductor, is obtained by multiplying 
the ith row of matrix Y/, with column vector V/: 


Th; = Yo Vin Yi + VioVh +--+ + YuVit--++ YiaViy). (22) 


This equation for transverse current flow is composed of two kinds of 
terms. 

(i) The current flow due to the self-admittance of conductor i for the 
half of the segment closest to location j is identified as (1/2) Y;;V!. 

(ii) Each of several mutual admittance terms accounts for the 
transverse current flow induced in conductor 1 caused by voltage on some 
other conductor, e.g., R, as (1/2) Yjz Ve. 

Rewriting eq. (22) to keep these terms separate gives 


Th; = YeYiVi — Yell; (23a) 
where 
M 
==> Va Vi. (23b) 
(hai) 


The choice of a negative sign with the summation is partially motivated 
by the fact that Y;, is generally negative. A completely analogous de- 
velopment may be pursued starting with eq. (11b). Equation (12) is again 
used to obtain the admittance matrix for the half of the segment oc- 
curring before location j + 1. Separating out the two types of contributors 
to transverse current flow produces 


Hf = Y,Vit — EP, (24a) 
where 
lip=—S Yavy". (24) 
(esei) 


The summation terms of eqs. (23b) and (24b) account for all effects 
associated with transverse coupling of the remaining conductors within 
the segment. When all the conductors are above ground, the mutual 
admittances Y;, consist of appropriately signed susceptance, and the 
coupling is referred to as capacitive. For buried conductors having direct 
contact with the soil, the Y;, parameters are both frequency-independent 
and dominantly conductive, which constitutes a second form of dissip- 
ative coupling. In actual buried installations, some conductors may be 
in direct contact with the soil, while others are insulated with dielectric 
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Fig. 4—Inductive, capacitive, and dissipative coupling model for ith conductor. 


coatings. Hence, the Y;, transverse admittance parameters quantify 
capacitive coupling and conductive coupling, both of which may occur 
simultaneously within a single exposure segment. 

A general coupling model can now be obtained based upon the fore- 
going development. All pertinent coupling mechanisms are contained 
within eqs. (20), (23), and (24), and an equivalent circuit based upon 
these equations will portray all the relevant physical concepts. Such a 
circuit is evolved in the following manner. 

The summation terms appearing in eqs. (20b), (23b), and (24b) may 
be modeled as dependent sources, in accordance with the substitution 
(or compensation) theorem of circuit theory.’ In particular, the induced 
longitudinal voltage term of eq. (20b) is modeled by an ideal dependent 
voltage source. This voltage source is termed ideal because it contains 
zero internal impedance. Moreover, it is dependent because its voltage 
value is determined by currents that exist on the other conductors within 
the segment, precisely in accord with eq. (20b). On the other hand, the 
compensation theorem permits modeling the induced transverse current 
terms of eqs. (23b) and (24b) by ideal dependent current sources. These 
current sources are termed ideal because they contain zero internal ad- 
mittance (i.e., infinite impedance). Moreover, they are dependent be- 
cause their current values are determined by voltages that exist on the 
other conductors within the segment, precisely in accord with eqs. (23b) 
and (24b). 

Having ascribed dependent sources to account for the summation 
terms, eqs. (20a), (23a), and (24a) lead directly to the general coupling 
model shown in Fig. 4. This model applies to each single conductor within 
a segment; i1.e.,1 = 1,---,M. Two conditions must be fulfilled for this 
equivalent circuit to constitute a valid coupling model. 

(¢) The equivalent circuit when subjected to standard circuit analysis 
techniques must yield precisely eqs. (20a), (23a), and (24a), since these 
equations characterize the pertinent coupling mechanisms. 
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(it) The circuit analysis techniques must yield only these equations, 
i.e., no extraneous information may be falsely implied. This second re- 
quirement is quite important if the model is to avoid artifacts suggesting 
physical behavior that is not actually present. 


Careful reflection should reveal that both these conditions have been 
satisfied in the illustrated coupling model. Hence, this equivalent circuit 
can be relied upon to furnish physical insight regarding the detailed 
interactions of all coupling mechanisms. 


3.2 Various shielding models 


With a reliable circuit model for coupling as a foundation, it is easy 
to evolve circuit models that describe various forms of shielding. In a 
rather fundamental way, shielding is nothing more than a wise utilization 
of available coupling mechanisms. These mechanisms are appropriately 
constrained to minimize an undesired signal that could contribute to 
interference. For instance, a low-impedance shunting path (such as a 
ground on an aerial cable sheath) is commonly used as a constraint upon 
capacitive coupling. Such a low-impedance terminal constraint serves 
to decrease the voltage that is supplied to an inherently high-impedance 
capacitive coupling mechanism and thereby renders the capacitive 
coupling mechanism ineffective. This type of shielding requires only one 
low-impedance shunting path and may be correctly termed electric 
shielding. (Historically, the unfortunate misnomer of “electrostatic” 
shielding has been used to describe electric shielding.) A second type of 
shielding requires at least two low-impedance shunting paths. These 
constraints allow a “shielding current” to flow which, in turn, induces 
a “‘shielding voltage” via magnetic induction. Owing to its enormous 
practical importance, this magnetic (or inductive) type of shielding will 
be emphasized in the following circuit representations. — 

The simplest form of inductive shielding is illustrated by the classical 
shielding model shown in Fig. 5. An understanding of the basic shielding 
phenomenon will be evident from a straightforward analysis of this 
simple circuit. However, a derivation of this circuit representation is first 
necessary to make clear its inherent limitations as caused by various 
simplifying assumptions. 

Consider an exposure segment in which attention is focused upon two 
individual conductors: one conductor that is to be shielded and a second 
conductor that will furnish the inductive shielding. The coupling of each 
conductor to all remaining conductors within the segment is obtained 
from the inductive coupling model of Fig. 4. To apply this model to the 
shield (conductor 2), the admittances to ground Yoo/2 are taken to be 
zero, since they are shunted by the (assumed) low-resistance grounding 
terminations R, and Ry. Moreover, the dependent current sources 
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Fig. 5—Classical inductive shielding model. (a) Dependent-source circuit. (b) Complex 
transformer circuit. 


(1/2)I,2 are absent since transverse coupling is assumed negligible. 
Similarly, the coupling model for the shielded wire (conductor 1) also 
has zero admittances. This is in anticipation of a terminating impedance 
to ground that is located external to the segment and is low in relation 
to the impedance of the distributed capacitance of the conductors. By 
so suppressing extraneous detail wherever possible, it is easy to obtain 
a clear representation of the inductive shielding mechanism. In fact, no 
additional complication arises if the shielded wire is actually a twisted 
pair. In this event, the parameters for conductor 1 are simply replaced 
by those in the longitudinal circuit of Fig. 6a, which is developed in the 
next section. 

The dependent voltage source within each coupling model is now 
decomposed. Recall that these voltage sources are determined by eq. 
(20b) with i given the values of 1 and 2. Those contributions to the 
voltage sources which arise from conductors 1 and 2 are first separated 
out as 


M 
Vs1 = Zigle + x, Zirlr (25a) 
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and 
M 
V52 = Zoli + 2 Zorl. (25b) 


The summation terms account for coupling to all remaining conduc- 
tors within the segment. If conductors 1 and 2 are physically near each 
other in comparison to the distance from each remaining current- 
carrying conductor, then Z1, ~ Zor, k = 3,--+, M, as an examination 
of the appropriate mutual impedance equations will verify. In other 
words, the coupling to conductors 1 and 2 from each “outside” conductor 
becomes essentially identical. Under this condition, the summation 
terms in eqs. (25a) and (25b) are practically equal, and their contribution 
to the induced voltage within each conductor is the primary voltage, 
Ve: 


M M 
DD Ziurdly ~ VP ~ YS Zorle. (26) 
k=3 k=3 


Although primary voltage is dependent upon the currents, J;,, (k = 3,-+-, 
M) which flow elsewhere within the segment, these currents are usually 
assumed to be unaffected by the presence of the shield, conductor 2. 
Hence, the V? appearing in conductors 1 and 2 of Fig. 5 is often taken 
to be independent, or fixed, just as with a true applied source. 

The remaining term in both eqs. (25a) and (25b) accounts for inter- 
action between the shielding and shielded conductor. Consider the first 
term in eq. (25a) which is the voltage induced into the shielded conductor 
as a result of current flow in the shielding circuit formed by conductor 
2. Since I. can be substantially 180 degrees out of phase from its assumed 
reference direction, this first term is rewritten as 


Zyol2 = —[Z12(—L2)] (27a) 
=-YV5s, (27b) 


where the expression inside brackets is the shielding voltage, V*. 
Shielding voltage as defined will often be of similar phase angle to V? 
(owing to the negative sign within the brackets), and appears in the 
circuit representation of Fig. 5a as an opposing voltage source (owing 
to the second negative sign outside the brackets). It will be convenient 
to view the difference between the primary and shielding voltage sources 
as a remnant voltage, V”. Hence, 


yr=Vp—Vs (28) 


is a useful measure of the shielding mechanism’s effectiveness since, by 
eqs. (25a), (26), and (27), V” is simply the total induced longitudinal 
voltage in the presence of shielding. This induced voltage is the driving 
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mechanism for longitudinal current flow in the shielded circuit and, 
consequently, for the accompanying longitudinal-to-metallic conversion 
process. The longitudinal current flow is usually small, however, in 
comparison to that in the shielding circuit, since the shielded circuit 
termination impedances are generally large compared to Rg and Rp. 
Hence, it is assumed that J, = 0 for purposes of the classical shielding 
model. This idealization allows the first term in eq. (25b) to be discarded, 
although its would-be presence is indicated in Fig. 5a by a “dashed-in” 
dependent voltage source. 

By way of summarizing the physical mechanisms contained in the 
dependent source circuit of Fig. 5a, asimple calculation of shielding ef- 
fectiveness will be carried out utilizing this classical shielding model. A 
measure of shielding effectiveness is the shield factor, 7, defined as 
remnant voltage normalized to the exciting primary voltage 


ai 
n= (29a) 
or 
ys 
n7=1- Vp ; (29b) 


where the last relation follows from eq. (28). The shielding voltage de- 
pends upon longitudinal mutual impedance and shielding current as 


V5 = Zy0(—I2). (30) 


The latter quantity obtains from the shielding conductor loop equa- 
tion 


Vp 
Be ame Zn thy ree (31a) 

where 
Rr=R,t Rp. (81b) 


The amplitude and phase of the shielding current (and thereby the 
shielding voltage) is fundamentally dependent upon both longitudinal 
self-impedance and total termination resistance. Utilizing the above 
relations with the definition of eq. (29) yields the shield factor as 


__ 412 

Lo +t Rr 
Thus, the circuit representation leads quite directly to a simple ex- 
pression for shielding effectiveness. 


It is sometimes enlightening to visualize the inductive shielding 
mechanism in the context of a single-turn transformer. The circuit 


n=1 (32) 
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formed by the shielding conductor and its earth-return path is viewed 
as one side of a transformer. The shielded conductor is viewed as an 
open-circuited secondary winding. A primary field is presumed to excite 
both windings, owing to its associated magnetic flux cutting both circuits 
equally. The resultant current flow in the primary winding then couples 
a shielding voltage into the secondary winding as a result of mutual in- 
ductance between the windings. These notions have been highlighted 
in the alternate shielding model shown in Fig. 5b. 

The validity of the circuit representation in Fig. 5b rests upon its direct 
equivalence to that of Fig. 5a. Basically, the longitudinal coupling as 
characterized by dependent sources in Fig. 5a has been alternatively 
characterized as a complex-valued mutual inductance, with an appro- 
priate transformer dot convention, in Fig. 5b. A conceptual generalization 
is required here in that the new mutual inductance must now represent 
energy dissipation as well as energy storage. That is, both the resistive 
and reactive parts of longitudinal mutual impedance are to be charac- 
terized by a mutual inductance term, 


Z12 = (Rig + Joly) = jamie. (33) 


Hence, this new mutual term must assume a complex value given by 
| 1 
Mio = (a2 + = Ruz). (34) 
Jw 


It turns out for close conductor spacings that M2 is dominantly real- 
valued (about 90 percent) as given via Lyo, and the remaining imaginary 
term, —jR12/w, characterizes dissipative coupling associated with the 
earth’s nonzero resistivity. Although for most practical conductor 
spacings the coupling mechanism itself remains dominantly inductive, 
the phase relationship between V* and V?, and thereby the shielding, 
can be significantly affected by R19. 


3.3 Longitudinal-to-metallic conversion model 


The general analytical model and all subsequent models described 
so far have been developed with the earth-return reference convention 
for voltage and current variables. This choice of reference convention 
was motivated primarily from the standpoint of consistency, such that 
various segments and termination constraints could be systematically 
cascaded to facilitate easy computer evaluation. Sometimes physical 
insight into a particular aspect of the overall problem can best be en- 
hanced by adapting voltage and current variables which have a reference 
convention relating more directly to actual operating conditions. For 
instance, the effects of unbalanced operation in telephone circuits (which . 
are intended to operate in basically a balanced circuit mode) are most 
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easily understood when analyzed as a so-called longitudinal and metallic 
circuit. 

The relationship between the earth-return reference convention and 
the longitudinal and metallic reference convention is covered in detail 
in Appendix B. This relationship constitutes a change in variables, or 
a transformation, when stated in mathematical terms. This change 
further requires an appropriate modification of the previous circuit 
relations. The transformed equations, also detailed in the appendix, 
basically relate impedance and admittance quantities as used in a 
longitudinal or metallic circuit back to those appropriate to an earth- 
return circuit. The nomenclature and definitions introduced in Appendix 
B will be utilized in the following development. 

To focus attention upon the electrical behavior of conductors 1 and 
2 utilizing longitudinal and metallic reference conventions, the trans- 
formed circuit relations will be mathematically “partitioned.” As in the 
specialized models already developed, one portion of the matrix equa- 
tions remains unaltered, and the influence of other portions are lumped 
together into a single dependent term. This allows specialized circuit 
models for the first two conductors to be derived, in which the effects 
of all other conductors 3 through M are characterized as dependent 
voltage and current sources. From eq. (48) in Appendix B, the longitu- 
dinal interaction involving conductors 1 and 2 is given by 


d vii Zm YAZ]TIm Vin 
Lvel- (el -[eaezellel+bz) 
Vi Vb AZ LZeltle Voe 
Here, the voltage contribution due to other conductors within the seg- 
ment is denoted as 


M 
Vom = & (Z1k > Zor Lp (36a) 
=3 


for the metallic component, whereas for the Jongitudinal component, 


M 
Voe= x Th(Zir + Zor)lr. (86b) 


Hence, excitation of the longitudinal mode depends upon the average 
longitudinal mutual impedance between the pair conductors (1 and 2), 
and each of the others (k = 3, ---, M). On the other hand, excitation of 
the metallic mode depends upon the difference in mutual impedances, 
as based upon the earth-return reference convention. This difference 
is typically minimized by twisting the pair conductors, causing them to 
effectively occupy the same position in relation to the other conductors. 
The transverse interaction, which corresponds to eq. (11), follows using 
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eq. (49) as 


Tim] 10 Ym AY] [Va] 1f Em 
ae Nie A Deca (ea IR, (37a) 
Thel 2LY%AY Yel LV? 2LTbe 
The transverse current flow due to other conductors within the segment 
is given by 


: M . 
Liga 2, To(Yir — Yor) Vi (38a) 
and 
. M ; 
Ue=- x (Yur + Yor) Vi. (38b) 


As in eq. (36), excitation of the longitudinal mode depends upon a sum 
of transverse mutual admittances, whereas excitation of the metallic 
mode depends upon a difference of terms characterizing coupling to 
other conductors. The seemingly illogical appearance or omission of a 
1/2 multiplicative factor is just a peculiarity of the longitudinal and 
metallic reference convention stated in eq. (43), Appendix B. A second 
eq., (37b) (not given), is obtained by replacing j with j + 1 and the sub- 
script a with b. All the physical phenomena between locations j and J 
+ 1 is encompassed within eqs. (35) and (37). If V3,---, Vay and 13, ---, 
Iy are assumed known, either by previous analytical solution or by direct 
measurement, the above equations yield the response for conductors 1 
and 2. 

Physical insight is gained by constructing equivalent circuit models 
whose behavior is controlled by eqs. (35) and (87). It is convenient to 
derive two specialized circuit models: one to characterize the metallic 
variables and another for the associated longitudinal variables. Conso- 
lidating similar variables from eqs. (35) and (37) by simple rearrange- 
ment yields the pairs 


Vi, ~ Vit! = Zinlm + Vem 
Thm = We¥mVin — Voli 
Ibi = YeYmVit* — Yori (39) 
and 
Vi — Vit} = Zele + Vee 
The = YoY eVh — “oli, 
Tf! = YoY e Vit! — YoIiG}, (40) 


where the new subscript s denotes source. Two terms are contained in 
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each dependent voltage source: 


Vian = hAZI ¢ + A es 
M 
= %4(Z11 — Zoo)le + x, (Zip — Zor)Ip (41a) 


and 


Vse = WAZIn + Voe 
M 
= 1(Z11 — Zo2)Im + x, Yo(Zip + Zor). (41b) 


The first term in each source represents mode coupling due to impedance 
unbalance and the second term represents the coupling to other con- 
ductors as defined in eq. (36). Similarly, the dependent current sources 
become 


i = y bAYV) + Eon 


= —l(Y 1, — Yoo) V4 —- 3 Yo( Yin — Yor) Vi (42a) 


and 


Tig = —%AYVi, + Le 
: M ; 
= —'(Yi1 — Yoo) Vin — », (Yiz + Yor) Vi. (42b) 


Here the first term in each source represents mode coupling due to ad- 
mittance unbalance and the second term represents coupling to other 
conductors as defined in eq. (38). A corresponding set of current sources 
is obtained from eq. (37b) simply by replacing j with j + 1 in the above. 
The metallic circuit eqs. (39) can now be characterized by the equivalent 
circuit in Fig. 6b, and the associated longitudinal circuit eqs. (40) simi- 
larly in Fig. 6a. 

It should be noted that in general these two circuits are “coupled” via 
the dependent sources; that is, the dependent voltage and current sources 
for the metallic circuit require knowledge of Vz and I¢ from the longi- 
tudinal circuit, and vice versa. In certain situations, however, the cou- 
pling parameters are such that the excitation from a dependent source 
becomes independent of the other equivalent circuit. In this situation 
the metallic and longitudinal segment models are said to be “decoupled”; 
i.e., they function independently in the absence of termination unba- 
lances. Furthermore, it may even occur that mutual coupling parameters 
to the partitioned conductors 3, +--+, are such as to suppress excitation 
of a dependent source. To illustrate these points, consider the case of 
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Fig. 6-—Longitudinal-to-metallic conversion model. (a) Segment of a longitudinal circuit. 
(b) Segment of a metallic circuit. 


a twisted pair with small admittance and impedance unbalances enclosed 
within a conducting sheath. The twisting results in Zo. ~ Z4j, i.e., small 
impedance unbalance, and generally |J,,| « |J,| for some k = 3,---, 
M. This allows the first term in eq. (41b) to be dropped since, as the 
product of two small quantities, it is negligible compared to the sum- 
mation term. Similar reasoning applies to the first term in eq. (42b), 
which also can be dropped since the admittance unbalance is assumed 
small. Let us assume for illustrative purposes the remaining pair con- 
ductors are bonded to the sheath, and denote this aggregate collection 
as conductor 3. Then, within the summation, terms Yj; and Yo, are zero 
for k > 3, owing to capacitive shielding furnished by the enclosing sheath 
(conductor 3). Hence, the current sources can be removed from the 
longitudinal circuit model provided the sheath conductor is at ground 
potential, i.e., V3 = 0, while the voltage source becomes dependent only 
upon current flow in conductors 3 through M. This allows’ the longitu- 
dinal equivalent circuit to be solved first, subject only to various terminal 
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constraints (i.e., termination impedances). The results are then applied 
to the metallic equivalent circuit to assess longitudinal-to-metallic 
conversion, arising from coupling parameter and termination impedance 
unbalances. This simplified longitudinal circuit and the decoupled 
metallic circuit can be explored quite effectively to illustrate the longi- 
tudinal-to-metallic conversion process. 


IV. SUMMARY 


A systematic approach has been described for evolving lumped-ele- 
ment circuit models appropriate to low frequency interference analysis. 
By straightforward computer implementation one can reliably assess 
the intricate interaction of various physical mechanisms which occur 
in real life interference situations. Care has been taken to follow a strictly 
deductive approach in the modeling procedure. This ensures an accurate 
characterization of all relevant physical mechanisms, while preventing 
the occurrence of any extraneous modeling artifacts. 

The specialized circuit representations have highlighted the individual 
effects of coupling, shielding, and longitudinal-to-metallic conversion 
on telephone cable facilities. Their development systematically identifies 
all underlying assumptions and thereby offers clarification on the con- 
ditions which must be satisfied to permit confident reliance on these 
models. Three distinct types of coupling are identified: inductive, 
capacitive, and dissipative (the latter taking both longitudinal and 
transverse forms). Electric and magnetic types of shielding have been 
motivated as just wise utilizations of available coupling mechanisms. 
Finally, the longitudinal-to-metallic conversion model is unique in its 
ability to concisely characterize an arbitrary multi-conductor power and 
telephone environment. 

A carefully prepared glossary has been included as Appendix D. 
Particular attention is given to resolving jargonistic ambiguities by re- 
ferring back to fundamental principles. It is the author’s experience that 
unnecessary confusion generally results from the use of ‘“‘loose” termi- 
nology. In instances of conflicting historical usage, the present analytical 
framework is relied upon to furnish definitiveness. 
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APPENDIX A 


Perspective on Admittance Matrices 


Transverse coupling can occur either within an individual exposure 
segment or from mutual interaction within a terminal constraint, as 
indicated by the model for cascaded segments shown in Fig. 3. In all but 
a few actual cases, the terminal constraint phenomena are generally 
dominant by manifesting high admittances, Y/, compared to those for 
distributed effects in individual segments, Y/, and Y}. In these situations, 
it is acceptable to assume Y/, and Yj are zero for computational purposes. 
Basically, this implies that within a segment the radial flow of dis- 
placement current (associated with distributed capacitance) and of 
conduction current (associated with leakage conductance) are both 
negligible, either in themselves or with respect to current entering con- 
ductor nodes of the terminal constraint at location j. These conditions 
are typically met for nodes to which practical grounding configurations 
have been applied. However, for nodes at which grounding is either 
relatively poor or intentionally absent, all contributing admittances need 
to be retained. For instance, the conductance to ground and to other 
conductors may require consideration with direct burial noninsulated 
cable, such as concentric neutral power line. Conductance values can be 
obtained, either by means of direct measurement or by existing analytical 
techniques.® Moreover, the susceptance portion of Y and Yj, arising 
from self-capacitance and mutual capacitance effects may be important 
under either very high voltage or exceptionally low current situations. 
The former situation can exist in the proximity of EHV power trans- 
mission systems.® The latter low-current situation manifests itself in 
affecting actual open-circuit voltage-to-ground (or sheath) on pairs 
within long segments of telephone cable. 

Treating terminal constraint phenomena in the mathematical 
framework of admittance parameters is somewhat arbitrary, since a 
Thevenin representation might also have been chosen to characterize 
a general terminal constraint in terms of impedance parameters. When 
the terminal constraints are simply independent grounds, it is logical 
to consider the associated ground potential raise (GPR) a longitudinal 
impedance phenomenon as did Sunde,® since its primary effect is to 
restrict longitudinal current flow. On the other hand, when closely spaced 
nonindependent grounds interact strongly through mutual GPR effects, 
it is reasonable to view longitudinal currents as being partially excited 
through a localized transverse interaction with adjacent grounds acting 
as primary sources. Aside from whichever type of excitation actually 
prevails, it is computationally expedient to model mutual GPR interac- 
tion as a transverse admittance phenomenon, primarily because of the 
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Fig. 7—Relationship of longitudinal and metallic variables to earth-return variables. 


analytical simplification of combining parallel admittances through 
addition as illustrated in Fig. 3a. 


APPENDIX B 


Longitudinal and Metallic Reference Convention 


A portion of the general analysis pertaining to multiconductor seg- 
ments is modified by explicitly singling out two conductors for an al- 
ternate characterization in terms of the longitudinal and metallic ref- 
erence convention. This “change of variables” necessitates that a 
transformation be applied to the basic circuit relations. The modified 
equations for use with this alternate reference convention are summa- 
rized below. 

Let us identify conductors 1 and 2 for this new representation. (By 
appropriate choice of numbering, these two conductors might represent 
a twisted pair within a conducting sheath, or even more simply an open 
wire line. Alternatively, they could represent a phase and neutral con- 
ductor from a single-phase power system.) The basic approach is to de- 
fine four new variables, V,,, Ve, Im, Ig, in terms of the previous volt- 
ages-to-ground V,,V>2 and earth-return currents /1,/9 as follows: 


Vm = Vi -— Ve (metallic voltage), 
Vo = (Vi + Vo) (longitudinal voltage), 
Im = Ye — Ie) (metallic current), 
Ie=+Io (longitudinal current). (43) 


It may be helpful to note the last two current relations derive from 
Ty = Yole + In, 
Ig = Yole — In. (44) 


The relationships between old and new variables are illustrated in Fig. 
7. These new longitudinal and metallic variables differ quite notably 
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from the previous earth-return variables in that they encompass effects 
occurring on two conductors. Consequently, all four new variables 
quantify effects that are “shared” between conductors 1 and 2 as they 
function together. (It is interesting to note that standard noise measuring 
sets automatically record V¢ and V,,.) In summary, the foregoing change 
of variables can be compactly represented by transformations 7, and 
T; in the following matrix form 


Ve, a | : 
Ve % Wy 0 . 
Vr=|V;}=]0 0 1 | fe =(T,]V (45a) 
aes CR 
aos a 
Viz 0 0 0 | Sa 
and 
Im Y —" 01 ; 
ie ie 02 SOx 
Ir={i,|=]0 0 1 ie Olt Is” =(T;JI.  (45b) 
sie 
et 
ig 0 0 0 | ; 





The new transformed voltage and current column vectors are denoted 
as Vr and I7, respectively. Moreover, for ease of examination dashed 
lines are used to partition each matrix equation into regions having a 
particular form. Within the transformation matrices, the upper left-hand 
corner represents eqs. (43), whereas the lower right-hand corner pre- 
serves the earth-return reference convention for conductors 3 through 
M. 

Recall the transmission line eqs. (1) and (2) in their matrix form, such 
that they characterize the complete behavior within a multiconductor 
exposure segment. Interjecting the new transformed variables of eq. (45), 
the transmission line relations become 


dl 
= aoe = [T,YT5"]Vr=UrVr (46) 
and 
dV 
_ re _ [T, ZT) Ir = Zrir, (47) 
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where Yr and Zr are the new transformed incremental admittance and 
impedance matrices, as obtained from the indicated matrix operations. 
On the basis of reasoning identical to that preceding eqs. (10) and (11), 
a corresponding set of circuit relations is evolved in terms of the trans- 
formed variables. In particular, eq. (10) becomes eq. (48) shown on page 
1691, where the following notation has been introduced: 


Zm = 211 + Zo2 — 2Z 32 (metallic circuit impedance), 
Ze =%q(Z11 + Zoo + 2Zy2) (longitudinal circuit impedance), 
AZ = 211 — Zoe (impedance unbalance). 


Individual Z;, values are the matrix elements of Z//+! defined in eq. 
(10). The superscripts on Z}/*! and I//*", which identify the specific 
segment under consideration, have been omitted on the matrix and 
column vector elements in eq. (48) to minimize notational congestion. 
The transformed version of eqs. (11) follow simply upon noting eq. (49), 
shown on page 1691, where 


Ym = Ya(Y11 + Yoo — 2Y12) (metallic circuit admittance), 
Ye = Yu. + Yoo + 2Yy0 (longitudinal circuit admittance), 


AY = Yy1 — Yoo (admittance unbalance). 


Both Z#*! and Yi#*! are symmetric matrices since all Zj, = Zpi and 
Yin = Y,; from reciprocity. 


APPENDIX C 


Circuit Model Frequency Restrictions 


It is worthwhile from both an applicational and theoretical point of 
view to clarify the frequency range over which the equivalent circuit 
models are applicable. Recall that in starting from the transmission line 
eqs. (1) and (2), and going to the lumped-element circuit theory eqs. (10) 
and (11), an electrically short segment of the loop plant was specifically 
considered. The meaning of this earlier assumption is now examined in 
some detail. 

Note the electrically short assumption is likewise implicit in arriving 
at the metallic and longitudinal circuit eqs. (39) and (40), to which the 
equivalent circuit models of Fig. 6 apply. The initial discussion will focus 
upon these two models and later be expanded to include all conductors 
within an exposure segment. Whereas these circuit models are exact 
characterizations of the lumped-element circuit equations, it is impor- 
tant to know how well they approximate the behavior associated with 
the original transmission line differential equations. Fortunately, this 
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Fig. 8—Equivalent network for a general transmission line. 


question is rather easily resolved, since an exact pi-network model is 
available for the general transmission line case.!° Specifically, the exact 
behavior of the terminal voltage and terminal current is characterized 
by the equivalent network shown in Fig. 8. The particular choice of 
voltage and current variables defines a form of “mode” which will be 
designated as n. Here, n = £ or m, for either the longitudinal or metallic 
type mode response of the transmission line differential equations. The 
characteristic impedance for mode n is defined by 


Zon = 2, (50) 

| ¥; 

and the all-important electrical length is 
Ynl =VZnYn> (51) 


where the Z2,Z,, and Y2,Y were specified in Appendix B by eqs. (48) 
and (49), respectively, for a segment length of A@ = 2. 

The general transmission line equivalent network of Fig. 8 reduces 
to the impedances and admittances within the circuit models of Fig. 6 
for sufficiently small values of y, 2. To see this, note that 


sinh yn, > yn (52a) 
and 
‘anh es anf (52b) 


for |yn@| « 1. Hence, for the series impedance branch of the pi-net- 
work, 


Z1 = Zon sinh ynl — ZonYné = 22 VZn¥n = Zn3 (53) 
for the two shunt admittance eo 


Yi = 5 tan nh 228 wt VEY, [ 24/B=%, (54) 


on 
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where n = @ or m. The restriction |y,¢| « 1 can be relaxed substantially 
without incurring appreciable error. For instance, with |y,@| < 0.5, the 
circuit elements of Fig. 6 will be within 5 percent of their true values as 
given by transmission line theory in Fig. 8. This condition serves as a 
practical guide in deciding the upper frequency limit for which just one 
segment of the longitudinal and metallic circuit models furnish rea- 
sonable accuracy. 

A few observations of more general nature will now be made. The 
metallic and longitudinal variable equivalent circuits derived previously 
focus attention upon the circuit variables and parameters contained in 
the upper part of the partitioned eqs. (48) and (49). Dependent sources 
were utilized to account for the coupling to other conductors within the 
segment. One can similarly focus attention upon the lower part of the 
partitioned eqs. (48) and (49). In this case, each distinct conductor will 
have just a single earth-return equivalent circuit, corresponding to that 
shown in Fig. 4. Moreover, each conductor will give rise to one distinct 
mode n, wherein the impedance and admittance parameters of the 
transmission line solution and lumped-element circuit solution relate 
as Zn = Znn, Yn = Ynn, n = 3, -++, M. Hence, we see that every conductor 
contained in the segment has associated with it an electrical length, y, 2, 
the largest of which will be denoted as yy@. Evidently a segment is 
electrically short, and thus characterizable by a correspondingly simple 
circuit model, whenever |y1@| is sufficiently small in the sense discussed 
above. The presence of dependent sources does not alter this conclusion, 
since they too characterize only electrically short behavior. That is, the 
sources are composed of mutual admittance and mutual impedance 
terms, for which it can be shown that | Yiz| < | Yz2| and |Ziz| < |Zzx| 
for all k (or n) of interest. Hence, if the self-admittance and self-im- 
pedance terms, Y,,, and Zyn, satisfy an electrically short criterion, so too 
must the mutual terms. 

It should be stated that these modes and associated propagation 
constants, y,, are somewhat unorthodox; they differ from the customary 
“eigenmodes” of the M-conductor system. Kigenmodes are ordinarily 
defined# from the n eigenvalue roots, 72, of the matrix equation | YZ — 
y2U| = 0, where U is a unity matrix. Each distinct root then constitutes 
two eigenmodes, e+¥n?, having the property that they can propagate 
independently without undergoing distortion. To the extent the largest 
Yn = max yp can be easily estimated without extensive numerical pro- 
cedures, this more orthodox approach furnishes a completely rigorous 
method for testing a multiconductor segment of length @ to determine 
compliance with being electrically short. The values of ,, associated with 
the eigenmode approach differ from those described above in accordance 
with the levels of mutual coupling, as characterized by the dependent 
sources in the circuit models of Figs. 4 and 6. 
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APPENDIX D 
Glossary 


Conduction 
Current 


Conductive 
Coupling 


Controlled 
Sources 


Dependent 
Sources 

Displacement 
Current 


Dissipative 
Coupling 
Earth-Return 
Reference 

Convention 


Effective 
Mutual 
Impedance 


Electric (or 
Capacitive) 
Coupling 

Electromagnetic 
Coupling 


External 
Impedance 

Internal 
Impedance 
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The current flow that is associated solely with 
the finite conductivity, oc, or resistivity, p, of a 
medium; e.g., the current flow through a 
resistance. 

The contribution to transverse coupling arising 
from Ohm’s law and the flow of conduction 
current. 

Current and voltage generators (sources) 
controlled by voltage and current signals, 
respectively. Since controlled sources are 
dependent on a control signal, they are often 
referred to as dependent sources. 

See controlled sources. 


The current flow associated solely with the 
permittivity, «, of a medium; e.g., the current 
flow through a capacitance. 

A general term encompassing the physical 
mechanism of resistive or conductive coupling. 

A voltage variable whose reference potential is 
that of remote ground; a current variable 
whose return path is assumed to be through 
the ground. 

The ratio of the total induced open-circuit 
voltage on the disturbed circuit to the 
disturbing power system phase current with 
the effects of all conductors taken into account. 

The contribution to transverse coupling arising 
from Gauss’ law of electric induction and the 
flow of displacement current. 

A general term encompassing primarily electric 
and magnetic coupling and, in principle, 
dissipative coupling too. 

The (total) longitudinal impedance less that 
contribution due to internal impedance. 

The sum of the conductor resistance and internal 
reactance (from the magnetic field inside the 
conductor). 
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Longitudinal 
and Metallic 
Reference 
Convention 

Longitudinal 
Circuit 


Longitudinal 
Coupling 


Longitudinal 
Current 


Longitudinal 
Impedance 


Longitudinal 
Input 
Impedance 

Longitudinal 
Voltage 


Magnetic (or 
Inductive) 
Coupling 

Metallic 
Circuit 

Mutual 
Coupling 


Phasor 


Voltage and current variables that are related to 
the earth-return reference convention as shown 
in eq. (48). 


A circuit utilizing longitudinal reference 
conventions for the voltage and current 
variables. 

The force exerted on a charge by an electric field 
in a longitudinal or axial direction of a 
conductor. 

The directed current flow along the axis of a 
conductor; this flow may be measured using 
either earth-return or longitudinal reference 
conventions. 

A quantitative measure of longitudinal coupling 
within a segment, accounting for both 
inductive and resistive types of coupling 
(dependent upon chosen reference 
convention). 


The Thevenin impedance looking into the 
longitudinal circuit formed by the wire pair 
and terminating impedances. 

The change in voltage occurring along the axis of 
a conductor; this change may be measured 
using either longitudinal or earth-return 
reference conventions. (Also the voltage 
variable in the longitudinal reference 
convention). 

The contribution to longitudinal coupling arising 
from Faraday’s law of magnetic induction and 
the flow of conduction current. 

A circuit utilizing metallic reference conventions 
for the voltage and current variables. 

A transverse admittance or longitudinal 
impedance which relates the stimulus on one 
conductor to its response upon another 
conductor. 

A harmonically time-dependent complex 
number. Phasors are added, subtracted, 
multiplied, and divided in the same way as 
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Power Line 
Balance 


Power Line 
Balance 
Factor 


Primary 
Voltage 


Remnant 
Voltage 


Remote 
Ground 
Potential 

Residual 
Current 

Resistive 
Coupling 

Self-Reaction 


Shield 
Factor 


Skin Depth 
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complex numbers. Often phasor additions are 
referred to as vector additions. 

The relative absence of residual current at a 
particular frequency for a group of power line 
conductors; balance at one frequency, in 
general, does not imply balance at another 
(harmonic) frequency. 

The ratio of the sum of the phase current 
magnitudes divided by the residual current 
magnitude; this quantity or its logarithm is a 
figure of merit for the degree of balance of a 
power line at a particular frequency. 

The voltage induced in the disturbed circuit by 
current in the disturbing circuit, in the absence 
of the intended shielding circuit. 

The voltage induced in the disturbed circuit by 
current in the disturbing circuit with the 
intended shielding circuit present. 

The potential of a region several skin-depths into 
the earth immediately beneath some specified 
conductor location. This reference potential is 
usually assumed to equal zero. 

The instantaneous or phasor sum of the 
conductor currents for a group of power line 
conductors; this current returns to the source 
through a path other than those conductors. 

The contribution to longitudinal coupling arising 
from Ohm’s law and the flow of conduction 
current. 

A transverse admittance or longitudinal 
impedance which relates the stimulus to its 
associated response upon the same conductor. 

The ratio of remnant voltage to primary voltage 
for a constant value of disturbing current; a 
measure of shielding effectiveness, i.e., the 
smaller the ratio or shield factor, the better the 
shield in reducing remnant voltage. 

The depth into a conductor at which the 
magnitude of the electromagnetic field is 
reduced to approximately 1/e of its surface 
value; its thickness is given in terms of 
resistivity, radian frequency, and permeability 
as 6 = V2p/wy . 
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Skin Effect 


Transverse 
Admittance 


Transverse 
Coupling 
Transverse 


Current 


REFERENCES 


A phenomenon associated with time-varying 
fields which tends to concentrate currents 
toward the surface of conductors that are 
nearest to the field sources which produce the 
currents. The skin-depth is a measure of this 
effect. 

A quantitative measure of transverse coupling, 
accounting for both capacitive and conductive 
types of coupling (dependent upon chosen 
reference convention). 

The force exerted on a charge by an electric field 
in a direction transverse or perpendicular to 
the axis of a conductor. 

The outward current flow perpendicular to the 
axis of a conductor which accompanies a 
decrease in longitudinal current. 
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Idle Channel Noise Suppression by Relaxation 
of Binary ADM-Encoded Speech 


By S. V. AHAMED 


(Manuscript received November 3, 1977) 


Techniques of identifying the location of silence periods in the binary 
data of prerecorded telephone messages from Adaptive Delta Modu- 
lation (ADM) encoders are discussed. Two algorithms for detecting and 
replacing such periods by absolute silence are investigated. In the first 
method one or two words (each 16 bits long) spanning one or two msecs, 
respectively, at a 16 kHz ADM-sampling frequency, are searched. They 
are either replaced or not in their entirety by a perfect silence sequence 
based upon a computed decision. In the second method, blocks of any 
prespecified number (1 to 512) of words spanning 1 to 512 msec at 16 
kHz are searched to detect the authenticity of a silence period within 
a smaller block (typically between 1 to 31 words) spanning 1 to 31 msec 
embedded within the larger block. The first method has yielded un- 
satisfactory results, and the second method with larger window scan 
of 80-200 msecs and a smaller window duration of 1 to 5 msec has 
yielded nearly perfect results by eliminating all of the idle channel 
noise without degrading the quality of the message. This technique of 
locating the silence periods in ADM encoded speech is compared with 
that for ADPCM encoded speech. Also the optimized parameters (for 
making the computed decision) are shown to be satisfactory for pre- 
venting false silence clues for genuine speech data over the wide vari- 
ations of frequencies and amplitudes. Further, these parameters may 
be converted for different clock rates,and some are traced back to the 
specifics of the encoder and the others to the character of speech. 


I. INTRODUCTION 


The source of idle channel noise in most ADM (Adaptive Delta Mod- 
ulation) speech encoders is the nonuniformity of the data stream gen- 
erated by the encoder during silence periods. From a relaxation* study 


* In the context of this paper, relaxation implies changing the binary data to a state such 
that any further change will not yield an improvement in the quality of the speech or of 
silence. 
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on the computer it becomes obvious that if the decoder is forced into a 
repetitive input bit pattern synchronized with the main ADM clock, the 
decoder idle channel noise can be suppressed. The decoder, responding 
only to the incoming data stream, cannot generate any noise on itsown, 
and a perfect encoder, to assure silence, would provide a repetitive 
pattern to the decoder. In the absence of such an encoder, it is possible 
to determine the silence periods during speech by a computed decision 
and force the bit pattern to be repetitive during such periods. 

The silence periods are particularly conspicuous in exposing the im- 
perfection of the encoder. Being devoid of any meaningful information, 
they can become annoying if the encoder does not produce a bit pattern 
which forces a complete silence for the decoder. For instance, the most 
commonly used bit pattern is 0101. . ., even though other patterns such 
as 001100. . ., 01001101. . ., etc., all yield a type of semisilence or humming 
silence with intertwined frequencies. The nature of decoder silence is 
also influenced by the companding algorithm and the output filter 
characteristics. In this application we have obtained the best silence by 
a sequence of 0101... which offers two distinct advantages: 

(t) The compandor having been rendered inoperative by this se- 
quence, the step size decays to its minimum value. 

(it) The frequency generated by this sequence, being half the clock 
rate, is beyond the bandwidth of the audio filter. 


ll. SILENCE CLUES 
2.1 Cluster and transitions clues for ADM data 


When the only available data is the encoded binary stream from an 
ADM encoder, three clues to judge the authenticity of the silence period 
may be used: (7) the absence of at least one cluster of ones whose width 
exceeds that of a prechosen minimum threshold cluster, (ii) the absence 
of at least one cluster of zeros whose width exceeds that of a prechosen 
minimum threshold cluster, and (tii) the excess of transitions between 
zeros and ones and vice versa over a prechosen number of transitions 
during a prechosen interval of time. The first two clues together have 
been termed “cluster clues,” and the last one is termed “transitions clue.” 
There are thus five variables necessary to implement the scheme: (a) 
the number of ones in the minimum threshold cluster of ones, (b) the 
number of zeros in the minimum threshold clusters of zeros, (c) the 
number of transitions in the duration to seek the clues, (d) the duration 
of time to seek the clues and finally, (e) the duration for substituting the 
ideal silence sequence 0101. . . instead of the imperfect silence data from 
the encoder. 
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2.2 Code word energy clue for ADPCM data 


In Ref. 1, the concept of “code word energy” which is defined as an 
integrated energy for 16 msec (8 msec forward, 8 msec backward) around 
a preselected code word of ADPCM data is used. The concept has been 
used to locate the beginning and end of utterances. Individual discrete 
code words which convey one of the sixteen levels of amplitude infor- 
mation of the speech wave are scanned for amplitude deviation from a 
mean value. If the code word energy consistently exceeds that obtained 
during silence periods at the ADPCM encoder for 50 msec, then the be- 
ginning of the utterance is traced back to the instant at which the code 
word energy first started to exceed the threshold. Conversely, if the code 
word energy falls below that recorded during speech consistently for 160 
msec, the end of the utterance is traced back to the instant at which the 
code energy first receded from the threshold. 


2.3 Differences between clues for ADPCM and ADM data 


Code word energy clue is well suited for ADPCM data where amplitude 
information is discretely coded in each word. For ADM data, only the sign 
information is conveyed to the decoder and there is no distinguishable 
boundary between any neighboring bits. For this reason, the concept of 
code word energy is inapplicable for ADM-encoded data. The cluster clues 
contain the information that not once during the search interval did the 
encoder experience an unidirectional change in amplitude lasting for 
a minimum number of clock cycles. The transitions clue contains the 
information that the frequency of change of direction of the output from 
the decoder exceeds what one would expect it to be during low level 
speech signals, and very close to what one would expect during the silence 
periods. The cluster clue is based on the fact that speech contains the 
dominant portion of energy in lower formant frequencies and the tran- 
sition clue is based on the fact that speech energy concentration tapers 
off at higher formant frequencies. 

Further, the computed decision for ADPCM data is a process of only 
looking forward in time to determine the increase of energy over the si- 
lence threshold energy (for locating the beginning of an utterance) or 
the decrease of energy below that of the speech threshold energy (for 
locating the end of the utterance). In ADM data we have found it neessary 
to look forward and backward for clues to make a computed decision 
regarding the authenticity of silence during the intermediate interval. 
As shown in the following sections, if the five variables [(a) through (e) 
in Section 2.1] on which the concept is implemented, are carefully chosen, 
the cluster and transitions clues work very dependably. 
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lll. IMPLEMENTATIONAL DETAILS 


Implementing the concept has been attempted by two distinct tech- 
niques: (i) word relaxation schemes and (ii) block relaxation schemes. 
In word relaxation schemes the duration for seeking the cluster and 
transitions clues [i.e., (d) in Section 2.1] is made the same as the duration 
for replacing the ideal silence sequence 0101. . . instead of the imperfect 
silence data [i.e., (e) in Section 2.1]. In block relaxation schemes the 
duration for replacement has been made 2.3 to 13 percent of the duration 
for searching for the clues to judge the authenticity of the silence. 


3.1 Word relaxation schemes 


In this method the durations of search and replacement have both 
been made typically 1 or 2 msec. The width of the minimum threshold 
cluster for ones (hereafter referred to as “ones cluster threshold’’) and 
the width of the minimum threshold cluster for zeros (hereafter referred 
to as “zeros cluster threshold”) are both typically set at two. The 
threshold for the transitions (hereafter referred to as ‘‘transitions 
threshold”) is typically set at ten (16 kHz clock rate) for a one msec 
search and replacement window. The concept has been programmed in 
assembly language on a minicomputer in a conversational mode of input 
and output. Large blocks of data have been processed. The results have 
been analyzed both by subjective tests and oscillographic studies. 
Whenever the computer does replace the imperfect silence by a perfect 
silence the replaced data generates a limit cycle with respect to the input 
frequency and it appears as a stationary pattern on an oscilloscope. The 
scheme, even though successful about 90 percent of the time, fails to 
satisfy a perceptive listener during the silence period. When the three 
threshold parameters are made (3,3,8) respectively,* the quality of the 
silence becomes perfect but the speech tends to become slightly choppy 
and a happy compromise of the three threshold parameters has not 
proved possible. 

Similar results have been encountered with two msec duration for 
seeking the clues and replacing the silence bit-pattern. A slight im- 
provement gained in the quality of silence with perfect speech still does 
not satisfy a perceptive listener, and for this reason the block relaxation 
procedure has been developed. 


3.2 Block relaxation schemes 


The limited performance of the word relaxation scheme is due to a very 
narrow slice of time used to determine whether a word contains silence 


* These parameters are always written in the same order: ONES, ZEROS, TRANSI- 
TIONS. 
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Fig. 1—Data management during the detection and replacement of silence periods. a 
to d: Memory buffer refilled from the disk after processing the middle 256 words such that 
b is reread at a, c at b, etc. b toc: Limits of excursion of the small window and storage of 
processed data. 


or genuine data. The block relaxation scheme expands the search for 
clues over a much larger interval (large window) to compute the decision 
for a much shorter interval (small window, see Fig. 1). Typical durations 
for the large and small windows are from 39 to 131 msec and from 1 to 
5 msecs, respectively. The clusters and transitions clues are verified by 
the same method discussed in Section 2.1. The large window is defined 
during program execution by two parameters: (1) the duration for which 
the search for the clues should start prior to the beginning of the small 
window and (ii) the duration for which the search for the clues should 
continue after the end of the small window. Parameters (1) and (it) have 
been made distinct because the choppiness of plosives, fricatives, siblants 
and nasals at the ends of the utterances can be eliminated by making (7) 
longer than (ii) while simultaneously eliminating the slightest crackle 
before an utterance. The effects of changing the durations of the large 
window, the small window and the threshold parameters are tabulated 
in Table I. 


IV. DISCUSSION OF RESULTS 


Dependable determination of the silence periods and complete sup- 
pression of idle channel noise is possible with the right value of the 
threshold parameters and window sizes. On one hand if the threshold 
parameters are (1,1,16) for ADM data at 16 kHz with a large window and 
small window of 1 msec each, then the block relaxation scheme becomes 
identical to the word relaxation scheme and no change is effected be- 
tween the original data and the processed data. On the other hand, if the 
threshold parameters are made (16,16,0) with one msec small and large 
window, all the data is changed to that of a perfect silence. Other vari- 
ations of threshold parameters and window widths, if not all optimal, 
may fail one way (by retaining slight choppiness in the words) or the 
other (by imperfect idle channel noise suppression). Hence the search 
for optimal threshold parameters and window widths has led us to a 
minimum cluster threshold parameter of three* and to a transition 


* This defines that the presence of an “instantaneous” frequency generated with a cluster 
whose width exceeds three (i.e., 2 kHz and below at 16 kHz clock rate) within the large 
window qualifies the small window to be classified as genuine speech data. 
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Table | — Control parameters and quality 


Durations in msec 


Small Threshold 
File win- parameters Tyansi- Remarks, 
Name Before dow After Ones Zeros tions subjective 

A 3 3 3 3 3 80 Noisy silence, message OK. 

B 32 17 32 2 2 896 Noisy silence, message OK. 

C 3 3 3 3 3 88 Slightly noisy silence, slightly 
choppy words. 

D 3 3 3 3 3 72 Perfect silence, choppy words. 

T 64 3 64 3 3 1856 Noisy silence, message OK. 

U 64 3 64 3 3 1664 Perfect silence, message OK. 

V 64 3 64 3 3 1536 Perfect silence, very slight 
choppiness of words. 

P 64 3 64 3 3 1728 Trace of crackle in the silence 
periods, message OK. 

Y 18 3 18 3 3 480 Very slight crackle, very slight 
choppiness. 

Z 18 3 18 3 3 512 One crackle in 8 sec of silence, 
message OK. 

‘Pp* 18 3 18 3 3 480 One crackle in 16.2 sec of silence. 

O* 18 3 18 3 3 544 Three crackles in 16.2 sec of silence. 


Notes: 1. The clock frequency is 16 kHz. 

2. The message is a typical standard telephone announcement. 

3. Large window width is sum of three columns Before, Small window and After. 
* The imperfect silence of the encoder is processed here. 


threshold parameter of about 0.783? times the maximum number of 
transitions that can occur in the large window; the large and small win- 
dows themselves being 100 to 131 and 1 to 5 msecs respectively. This 
transition threshold parameter is consistent with our estimation of the 
frequency of 01 or 10 transitions and 00 or 11 and 000 or 111 cluster 
formations during silence periods. Sixty percent of the time the encoder 
generates a 01 or 10 transition, thirty percent of the time the 00 or 11 
cluster appears and ten percent of the time the 000 or 111 cluster is 
generated. These statistics yield the transitions threshold parameter 
as 0.783! times the maximum transitions one may expect. 

Computationally optimized values of the transitions threshold verify 
the result. For instance, in case of File U (Table I) processed with a large 
window width of 131 msec, the maximum number of transitions possible 
are 2096 at a clock rate of 16 kHz and the estimated transitions for the 
silence period are thus 1634. The computationally optimized value is 
1664. A similar assertion of the threshold parameters may be made for 
File P (Table I) with an estimated value at 489 transitions and a com- 
putationally optimized value of 480 transitions. 


t This defines that if the average frequency generated in the large window is above 6.26 
kHz (i.e., 0.783 X 8 kHz), then in the absence of the cluster clues, the data in the small 
window is the imperfect silence of the encoder. 


t 0.783 = 0.6 + 0.3/2 + 0.1/3. 
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V. RELATION BETWEEN OPTIMIZED PARAMETERS AND DATA ON 
SPEECH-SILENCE STATISTICS 

The computationally optimized control parameters are consistent with 
published speech statistics. 

Low level consonant sounds (some plosives “p”, “‘t’’ and fricative “‘6’’), 
which are short (80-50 msec) are not mistaken as silence because of 
neighboring higher level signals which precede or follow them. The 
cluster clues in the large window (up to 131 msecs wide) fail to be 
triggered. Longer duration (up to 230 msecs) low level consonant sounds 
especially sibilant ‘“‘s” and fricative “z” may tend to trigger the cluster 
clues but fail to trigger the transitions clue because of the larger value 
of the transitions threshold parameter. Other plosives, fricatives, sibi- 
lant-fricatives, nasals and semivowels lasting between 80 to 200 msec 
falling between the two earlier cases fail to trigger either the cluster clue 
(because of neighboring clusters in the large window) at the lower limit 
of 80 msec or the transitions clue (because of high value of transitions 
expected in the large window) at the higher limit. The fricative “f” and 
the semivowel “r” which have some spectral energy at about 3500 Hz and 
also last between 200 and 160 msecs, still fail the transitions clue since 
the average limit for the frequency of the imperfect encoder silence is 
about 6.26 kHz. Higher level consonants and key word vowels fail to 
trigger either cluster clue or the transitions clue because of their higher 
amplitude. As has been determined by the variation of the large window 
width on the computer, the results obtained by longer larger windows 
are more satisfactory than those from much shorter ones. 

The nature of the compandor (syllabic or instantaneous) also plays 
arole here. Consider a syllabic compandor whose encoded data is being 
scanned by a large window width of about 70 msec and a long sibilant 
or fricative that is at the end of a word. If the step size is large due to 
earlier letters in the word, then a false trigger of both the cluster and 
transitions clues is possible. Such a process yields a choppy ending of 
the word. However, a larger window of about 300 msec will eliminate this 
condition. Conversely, data from an instantaneous compandor would 
fail to trigger either of the two clues even with a 70 msec large win- 
dow. 

Further, we have noticed that an exact central spacing of the smaller 
window in the larger window is less desirable than spacing the smaller 
window towards the end (shifted 10-15 percent). Such a spacing has two 
advantages: (i) false clues by long sibilants and fricatives at the end of 
words (when the step size of the ADM encoder is already large due to 
earlier letters in the word) are eliminated and (ti) crackles in the silence 
periods located just at the beginning of words are correctly eliminated 
(because the step size is low due to the preceding silence, a cluster is most 
likely formed at the beginning of the utterance). A similar observation 
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has also been made in Reference 1 while detecting the end of an utter- 
ance. The critical duration after which code word energy falls below the 
threshold is 160 msec as against 50 msec for the energy to be above the 
threshold while detecting the beginning of the utterance. Large windows, 
inconsistently long in relation to the type of companding of the encoder 
face the risk of imperfectly suppressing the idle channel noise, and 
conversely large windows too narrow for the type of companding in the 
encoder face the risk of leaving behind traces of choppiness in the 
words. 

From the study presented in the paper for the data from an ADM en- 
coder* with an attack time constant of 3 msec and a decay time constant 
of about 9 msec, the large window should be about 120-150 msec en- 
compassing a small window of 3-5 msec located at 60 percent from the 
start of the large window duration. The cluster threshold constants are 
each 3 and the transitions threshold is approximately 0.783 times the 
maximum number of transitions possible in the large window. 


Vi. CONCLUSIONS 


Idle channel noise may be virtually eliminated from stored ADM 
messages by the block relaxation techniques presented in this paper. 
With a proper selection of the control parameters, the silence periods can 
be accurately located and replaced by a perfect silence. At very low clock 
rates (about 16 kHz) where the compromise between idle channel noise 
and intelligibility is very real, the intelligibility can be enhanced by re- 
cording with a low step size, which impliesa disturbing component of the 
idle channel noise. This can be completely suppressed, however, by block 
relaxation techniques. 

The flexibility offered by this technique also permits the detection 
of silence over a wide range of clock frequencies and amplitudes of the 
recording message. Typically it is necessary to scan about 70-2007 milli- 
seconds before judging the authenticity of a silence period and to base 
the judgment on the number of transitions and consecutive ones and 
zeros in that interval of time. 
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